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Evaluation of Adaptive Speech Coders Under 
Noisy Channel Conditions 


By C. SCAGLIOLA 
(Manuscript received June 20, 1978) 


An experiment has been performed in which the digital transmis- 
ston of speech coded by adaptive differential pcm was simulated 
under noisy channel conditions. The experiment was done with two 
aims: (i) to get information on the subjective effect of channel errors 
and the influence of various design parameters on the speech quality 
under various conditions and (ii) to find objective measures for 
predicting the overall quality of the processed speech over a wide 
range of circuit conditions. The subjective results show that, for a 
speech transmission through a channel with bit error probability up 
to 1/256, best results can be obtained with a slow error recovery, 
associated with fast quantizer adaptation. The use of slow error 
recovery and slow quantizer adaptation is preferable for channels 
with very high bit error rates, like 1/32. Overall subjective quality is 
well predicted by the sum of two terms: (i) an objective performance 
measure of the noise present on the output signal, disregarding any 
effect of level mismatching due to the sensitivity of the adaptation 
algorithms to channel errors and (ii) a measure of the level mismatch- 
ing which takes into account both the average gain on the output 
signal and its fluctuation in time. The best prediction scores are 
achieved by three newly defined objective performance measures, two- 
level compensated segmental snRs, and a spectral signal-to-distortion 
ratio. 


|. INTRODUCTION 

The design of digital waveform coders for speech communications 
must face the inevitable presence of channel errors. Adaptive coders, 
like ADPCM (adaptive differential pcm), in which the adaptation of the 
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quantizer step-size is derived from the transmitted binary stream and 
no error-protected side information is sent to the receiver, may be 
particularly sensitive to this problem. In fact, a single channel error 
may cause a multiplicative offset between the signal level at the . 
receiver and that at the transmitter. This offset may persist indefinitely 
if no error dissipation mechanism is provided. 

Recently, some algorithms of quantizer adaptation have been devel- 
oped that make the effect of a single transmission error die out over 
time, so the transmitter and receiver can resynchronize their step-size 
estimates.’” The possibility of obtaining such results is physically due 
to the fact that these algorithms have an imperfect adaptation: the 
step size increases more quickly and decreases more slowly for low 
input levels than for high ones. In this way, the step size is overesti- 
mated for low input levels and underestimated for high ones, thereby 
reducing the dynamic range of the coder. Dynamic range and error 
dissipation rate vary inversely, and the designer has to balance between 
them. 

In the case of speech transmission, the choice of the appropriate 
design parameters must be based on a precise evaluation of the 
subjective quality of the coded speech. Use of the conventional long- 
term signal-to-noise ratio as an estimator of the subjective quality 
would be, in this instance, completely misleading, at least because an 
offset in the signal amplitude between input and output, due to an 
offset in step sizes caused by an error, will be reflected in a noticeable 
squared difference between the two waveforms, while it may not be 
subjectively disturbing. 

To study the subjective performance of ADPCM coders operating 
under both error-free and noisy channel conditions, an experiment has 
been conducted, as summarized in Section II. The following three 
sections provide a brief description of the coding method, the definition 
of several objective performance measures, and the description of the 
experimental design and testing procedure. Sections VI and VII pro- 
vide analyses of subjective and objective measurement data. In Section 
VIII, the results of the previous two sections are discussed and a 
physical interpretation is given of the principal findings on quality 
prediction. 


ll. OVERVIEW OF THE EXPERIMENT 

The experiment included 12 different ADPCM coding schemes, which 
comprised all combinations of two bit rates, two adaptation time 
constants, and three error dissipation rates. These systems processed 
a total of 288 speech samples from four talkers (two male and two 
female), at two different power levels (24 dB apart) and with three 
different probabilities of independent errors on the channel. 

Twenty listeners rated the quality of the processed speech samples 
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on a scale from 1 to 9. The odd values were associated with the 
adjectives: unsatisfactory, poor, fair, good, excellent. In addition to the 
subjective data, a fairly large number of objective performance mea- 
sures were also taken on the processed speech samples. 

The aims of the experiment included the study of: 


(t) The influence on speech quality of the above design param- 
eters. 

(iz) The optimum combination of parameters for a given error 
probability. 

(iii) The objective measures or combinations of objective mea- 
sures which are good predictors of speech quality even under 
noisy channel conditions. 


The principal conclusions drawn from the analyses of the subjective 
and objective data are: 

(t) Contrary to a common feeling, a very slow error dissipation 
is sufficient to ensure good robustness of the ADPCM coder to 
error rates even much higher than those encountered in a 
normal telephone connection. When no recovery mechanism 
is provided, a fairly slow adaptation makes the system not 
very sensitive to channel errors in the range of error rates 
typical of a telephone connection. 

(tt) For speech transmission through a channel with bit error 
rate up to 1/256, best results can be obtained with a very 
slow error dissipation, associated with fast quantizer adap- 
tation; when the slow error dissipation is associated instead 
with a slow adaptation, the system becomes fairly robust to 
very high error rates, like 1/32, at the expense of a slight 
quality deterioration at low error rates. 

(uit) Good predictors of subjective quality were found to be two- 
segmental SNR measures in which a compensation of the 
level mismatching between input and output was performed 
on a frame-by-frame basis. The combination of any of these 
measures with two separate measures of level mismatching 
further improved the prediction accuracy. 


lit: ROBUST ADPCM SYSTEM: A BRIEF DESCRIPTION 

Figure 1 is a block diagram of the appcm coder-decoder used in the 
experiment. The predictor is a second-order transversal filter, with tap 
coefficients 1 and —0.5. The step size A(k) is adapted according to the 
robust algorithm described in Ref. 1, which permits synchronizing the 
step-size estimates at transmitter and receiver, after a transmission 
error occurs, during a period of error-free transmission 


A(k + 1) = A%(k) - M(I(k)), (1) 
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Fig. 1—Block diagram of ADPCM coder-decoder. 


where the exponent £, 0 < B < 1, is the decay constant and M(J(R)) is 
the step-size multiplier at time k. The multiplier M(J(k)) depends only 
on the actual code word J(k) and assumes N = 227! distinct values 
{M,, Mo, --- Mn}, where B is the number of bits used to encode the 
prediction error e(h). 

The decay speed is shown to be independent of the actual code word 
transmitted and also of the values of the multipliers, being only a 
function of the decay constant 8.’ When B = 1, the decay speed is zero 
and the decay time infinite. With 6 < 1, the decay speed increases, but 
at the same time the dynamic range decreases. 

It has been shown that the loading factor (ratio between range of 
the quantizer and rms quantizer input) is a constant as a function of 
the input level if 8 = 1, but it is a decreasing function of the input level 
if 8B < 1.' The form of their relationship is almost linear, with slope 
approximately inversely proportional to log(Mn/M,)/(1 — £), as indi- 
cated by Fig. 6 of Ref. 1 and as recently proved theoretically by D. 
Mitra.’ Therefore, with a small 6 and a small ratio between the 
maximum and minimum multipliers, the coder will produce more 
granular noise for low input level and more overload distortion for high 
levels. 

The values of the multipliers play another important role in the 
overall performance of the coders under noisy channel conditions. In 
fact, they determine the magnitude of the initial offset after a single 
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transmission error. In the worst case, the offset is given by 


A‘(k)|  _ My 
A(k) ~ M, 


max 


(2) 





In the present experiment, the multipliers M; were related by the 
linear relationship: 


M; = [a + C(1 — a) (i — 0.5) JAC, (3) 


where A is the step size that gives optimum performance at the desired 
nominal input level. This was fixed at —21 dBm, i.e., 27 dB under the 
saturation threshold for the signal that in the internal 16-bit computer 
representation is 32767. This relationship was chosen because, for B 
= 1, the adaptation algorithm coincides with the magnitude estimation 
algorithm described by Castellino et al.,* and for this algorithm more 
information about the subjective effects of its parameters are avail- 
able.” In eq. (3), C essentially determines the mixture of granular noise 
and clipping distortion in the decoded prediction error at the nominal 
level. The parameter a controls mainly the speed of adaptation and 
hence the ratio between maximum and minimum multipliers. 


IV. OBJECTIVE PERFORMANCE MEASURES 


Several objective performance indices were measured for each ut- 
terance in the experiment. The speech samples used as input to the 
coders were low-pass filtered at 3.4 kHz before being sampled and 
converted into digital form by a 16-bit A/D converter operating at 8- 
kHz sampling rate. Figure 2 is a block diagram of the simulated circuit 
arrangement for performing coding and measurements. A filter after 
the ADPcM decoding limits the bandwidth of the output speech as in a 
real situation. A secondary path provides the reference signal so(k) 
with which the filtered output ro(k) is compared to compute the 
objective performance measures. With two identical filters in the main 


INPUT SPEECH OUTPUT SPEECH 


‘(Ky appem | 
CHANNEL ee cuEa FILTER 
COMPARE 
FILTER 
So(k) 


REFERENCE 
Fig. 2—Block diagram of simulated circuit arrangement for coding and measurements. 
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and reference paths, only the distortion introduced by the coder is 
measured. The filters are 5th-order elliptic low-pass with 3.4-kHz 
cutoff frequency, 0.25-dB in-band ripple, and at least 40 dB stopband 
attenuation. 

The measures are classified in the two categories of time domain 
measures and frequency domain measures. Frequency-weighted signal- 
to-noise ratios are included in the first category because they rely 
strongly on the exact time synchronization of the two waveforms and 
on the absence of phase distortion. 


4.1 Time domain measures 
4.1.1 Long-term signal-to-noise ratio (SNR) 


Y. sc(k) 


k 
» [so(k) — ro(k)]}°’ = 


SNR = 10 log 


where & ranges over all the samples of the utterance. SNR is the ratio 
between the long-term signal energy and the long-term noise energy, 
the noise being defined as the difference between reference signal so(k) 
and output signal 7o(z). 


4.1.2 Segmental signal-to-noise ratio (SNR.,.g) 


Here the utterance is divided into adjacent segments of J samples 
each, and the signal-to-noise ratio in each segment is measured in 
decibels. The noise is still defined as the difference between corre- 
sponding samples of reference and output speech. The segmental SNR 
is the average of these measures over the M segments of the utterance. 


J 
1 M-1 py soU + mJ) 
SNReeg = 57D 10 log ——_—_~________—___ (5) 
ee Y [so + mJ) — roi + mJ) 7 


J=1 


In this experiment, J = 128, corresponding to 16 ms segments. This 
measure, proposed by Noll,® was recently found to correlate very nicely 
with subjective ratings of ADPCM-coded speech.” A very important 
feature added to this basic formula consists in discarding from the 
computation those segments in which the signal power is below —54 
dBm. This threshold, whose value was found to be appropriate for 
high quality speech,’ was introduced to avoid a slight idle channel 
noise having an unduly great negative weight in the overall perform- 
ance measure. This is done also in all the following time domain 
measures. 
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4.1.3 Compensated signal-to-noise ratio (SNReom) 


This measure was specifically formulated to compensate for the 
level variations that may occur under channel error conditions when 
the coder has a slow error dissipation. The difference between reference 
and output signals due to level offset should not be measured in fact 
as nolse. 

To compensate for these level variations, let us formulate the coding 
process in the mth segment as composed of an amplification of the 
input signal, the addition of an uncorrelated random noise, and a 
possible de component. Therefore, the output process in the mth 
segment can be written as 


rok) = g(m)so(k) + q(k) + Q. (6) 


This coincides with the simple linear regression model of ro(k) on 
So(k). Therefore, the gain factor g(m) is the slope of the regression line 
of the ouput on the reference signal in the mth segment, and the noise 
term q(k) is the minimum error made in predicting ro(k) from so(k).° 

Let us define the signal-to-noise ratio in the mth segment as the 
ratio between the variances of g(m)-so(k) and q(k). This can be shown 
to be only a function of the correlation coefficient p(m) between the 
reference and output signals in the mth segment:® 

p(m) = —_ ABsr(m) = Slm)Sm) ; (7) 


where S,(m) indicates the summation of x(j) and S,,(m) the summa- 
tion of x(7)-y(7) over the J samples of the mth segment. 
Averaging in decibels across the M segments in the utterance, the 
compensated SNR turns out to be: 
1 M-1 p°(m) 


SNR.wom = = 10 log ——_,-——-.. 
Mee = 


(8) 


4.1.4 Average gain (G) 


This is the average in decibels of the gain factor g(m) defined before, 
across the M segments. From simple linear regression analysis, the 
gain g(m) is:® 
JSs-(m) a S,(m)S,(m) 


g(m) = 


The average gain, which is an indication of how much the output 
level was increased or decreased on the average, with respect to the 
input level, is: 

M-1 


1 
G= Vi 2 20 log g(m). (10) 
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4.1.5 Gain fluctuation (o,) 


This is simply the standard deviation of the gain g(m), measured in 
decibels, across the M segments. It is a measure of how much the 
output level fluctuates owing to transmission errors. 


| Mo 1/2 
Og = FE yi (20 log g(m))? — a : (11) 


4.1.6 Maximum signal-to-noise ratio (SNR max) 


An alternate way of compensating for the level mismatching between 
reference and output speech signals was found by defining the noise in 
the mth segments as 


e(k) = s.(k) — S0(k), (12) 
where 
§0(k) = ay(m) a ro(k) + Ao (13) 


is the least-square estimate of so(k) based on 7o(k). 

The ratio between the variances of the reference signal and the 
minimum estimation error e(k) (hence the name “maximum SNR’”’) is 
again only a function of the correlation coefficient p(m), defined by 
eq. (7). 

Averaging again in decibels across the M segments in the utterance, 
the maximum SNR is 


M-1 


1 
NRmax = >> 10 l 14 
SNR iu 4, og (14) 


1 — p*(m) 

It is readily seen that SNRmax iS always greater than SNRcom and that 
the two measures give essentially different results only for low-quality 
coding conditions. 


4.1.7 Frequency-weighted, segmental, signal-to-noise ratios 


This term indicates a fairly large class of measures. In these mea- 
sures, the frequency axis is partitioned into many bands, usually 
nonuniform, the reference and output spectra are compared, some 
performance measure is then computed over each band, and these 
measures are averaged across the bands. In the measures described 
below, the spectra are computed over 256 points (32 ms). The segmen- 
tal measures are obtained by averaging the measures taken every 128 
samples (16 ms). 
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Three measures are reported here. They are described more in Refs. 
7 and 9. The partitioning of the frequency axis is effected in those 
three cases according to the 16 classical articulation bands.’ 














= 1 M-1 1 16 oh 
SNRF| = Vi 2 Fe 10 log aa (15) 
S;(m) 
>) L;(m) 10 log 
M—- 
SNRF2 = at ive ps iia ) (16) 
M n=0 ,) L; (m) 
1 a 
5 = — 10 log ———_—_—_——_- 17 
SNRF De og Ny, Gn)’ (17) 
j=1 Sj(m) 


where S;(m) is the energy of the reference signal so(k) in the jth 
frequency band in the frame m and N,(m) is the corresponding noise 
energy. The noise is again defined as the difference between reference 
and output signals, the latter being preventively divided by the gain 
g(m) previously defined to compensate for level fluctuations. However 
g(m) is computed on the 256-point analysis window. In eq. (16), a 
“loudness weighting” has been introduced. The term L;(m) corre- 
sponds approximately to the subjective loudness in band J, and is 
computed as 


L;(m) = | | S-(f) |)? df, (18) 
band / 


where S,(f) is the spectrum of the gain-compensated output speech 
ro(k)/g(m). 


4,2 Frequency domain measures 


All the spectral measures here presented and used in the experiment 
are based on the concept of linear prediction or inverse filtering.'' The 
speech signal is represented by the pth order autoregressive model: 


s(k) = ) ais(k — 1) + u(k), (19) 
i=l 


where u(k) is the white spectrum excitation function and the a;’s are 
the coefficients of the inverse filter 


A,(z) =1—- > az. (20) 
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The coefficients a;’s are computed to minimize the residual power of 
the signal at the inverse filter output. 

In this paper, the dissimilarity between the spectra of reference and 
output speech in a given frame is computed essentially by comparing 
the residual powers of the signals so(#) and ro(k) filtered by the inverse 
filters A,(z) and A,(z), derived from the same two signals. Four residual 
powers can be computed in the mth signal frame: 


(t) P.(m) obtained passing so(k) through A,(z). 
(tt) Pa(m) obtained passing ro(k) through A,(z). 
(tit) Pe(m) obtained passing 7ro(k) through A,(z). 
(tv) Pa(m) obtained passing so(k) through A,(z). 


Four objective measures based on these concepts are presented in 
the following paragraphs. 


4.2.1 LPC distance measure (D,) 


This measure, proposed by Itakura,” is also called log likelihood 
ratio. The distance between output and reference speech in the mth 
frame is defined as 


Palm) 
P.(m) 





D, = In (21) 

It can be shown that D, can be expressed in terms of spectral 
differences between the Lec models of the two frames of speech.” 
Moreover, it results that the spectral difference is most heavily 
weighted in the peaks of the input speech smoothed spectrum, 1.e., in 
the speech formants. 

Interchanging the roles of reference and output pee a different 
log likelihood ratio is obtained: 


Pa (m) 


De =lIn P.(m) ’ 


(22) 





which has the same basic properties as D,. 
The measure used here is actually the arithmetic mean of D, and 
De, averaged across the M segments of the utterance: 








_ 117 Pan) | Pe(m) 
Dre i Se | (23) 


4.2.2 Bharucha index (Dg) 


This index is again a distance measure, similar to the log likelihood 
ratio. It has been formulated by Bharucha”™ and its definition is also 
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reported in Ref. 7. The basic idea is that of measuring the noise 
introduced by the coder by “notching out” the speech spectrum by 
means of a time-varying linear filter, whose transfer function is 
matched to the inverse of the short-term spectral envelope. The quality 
index proposed by Bharucha is essentially the average increase in the 
residual power at the output of the “notch filter,” due to coding: 


1 ae en 








Daag b, 1018 | Bim) * Pm) is 
where P,(m) and P,(m) are the powers of the reference and output 
speech in the mth segment and provide the appropriate scaling in the 
measure. It can be shown that an uncorrelated noise component in the 
output speech is inversely weighted, frame by frame, by the smoothed 
LPC spectrum of the input speech signal.”"* Therefore, the noise has 
more weight in those frequency bands where the signal energy is low; 
this is probably in conformity with subjective noise evaluation. 


4.2.3 Spectral signal-to-distortion ratios 


Following the same basic idea of the Bharucha index, two other 
measures were derived in the form of signal-to-distortion ratios. In 
fact, they are measured in decibels, and they increase with increasing 
quality, like the time domain snrs. With the first spR, a distortion 
power is defined as the difference between the prediction error powers 
P.(m) and P.(m) defined above. 

Before taking the difference, however, the term Pa(m) is multiplied 
by P.(m)/P,-(m) to compensate for level differences between input 
and output. 


Jl MoI P,(m) 
SDR) = u Py 10 log Pam) -1PAm)/Pim))2 Pam) [P.Am)/PAm)]=PAm)’ (25) 


In the second spR measure, the difference between the signal-to- 
prediction error ratios in decibels that is averaged to give Dz is instead 
computed relative to the input signal-to-prediction error ratio, and this 
new ratio is again averaged in decibels across the segments of the 
utterance. 


wala 10 log[ P;(m)/P.(m)] 
SDRE = 72,” 8 to leg Pim) Ban) (Pana 


In this experiment, Dz, SDR, and sDR2 are computed with an analysis 
window of 160 samples (20 ms) that was shifted by 128 samples (16 
ms) every frame. The inverse filters for computing P.(m) and Pa(m) 
were always of the 20th order. 
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V. EXPERIMENTAL DESIGN AND PROCEDURE 
5.1 Circuit conditions 


The choice of the experimental design variables was dictated by the 
criterion of a broad quality range and of each value the variables 
assumed having caused an effect that could be perceived for at least 
some combination of the other variables. 

Two bit/sample values were chosen, B = 3 and B = 4, and three 
values of the decay constant, 6 = 1, B = 255/256, and B = 63/64. 

The condition £ = 1 was included because it should give the broadest 
dynamic range and because it makes the ADPCM system identical to 
the earlier schemes.*” 

For the step-size multipliers, given by formula (3), C was kept 
constant for each bit rate and equal to the values 0.65 and 0.41 that 
were found to be subjectively optimum for B = 3 and B = 4, respec- 
tively.” The parameter a was given two values, 0.75 and 0.96875. The 
first value produces a rather fast adaptation, with a corresponding 
time constant of 0.5 ms. With a = 0.75, the ratio Mn/M, is about 4 dB 
and 5.5 dB for B = 3 and B = 4, respectively. The second value of a 
produces a rather slow adaptation, the corresponding time constant 
being 4 ms. The ratio My/M, is much smaller, about 0.5 dB and 0.7 dB 
for 3- and 4-bit/sample, respectively. 

Two different input levels, 24 dB apart, were used for every combi- 
nation of the other conditions. They were L = —33 dBm and L = —9 
dBm, i.e., symmetrical around the nominal input level —21 dBm for 
which the coder was designed to have optimum performance. The 
level —21 dBm was not included to keep the dimension of the experi- 
ment within reasonable limits of feasibility. 

Finally the channel was characterized by three different probabilities 
of independent errors, P(e) = 0, P(e) = 1/256, P(e) = 1/82. 

Summarizing, the experiment included 72 conditions which com- 
prised all the combinations of B = 3, 4; a = 0.75, 0.96875; B = 1,255/ 
256, 63/64; L = —33, —9; P(e) = 0, 1/256, 1/32. 


5.2 Preparation of stimuli 

Each experimental condition was simulated four times, using as 
input signals four sentences spoken by two male and two female 
talkers. Each talker spoke into a high-quality dynamic microphone, 
while seated in a sound-proof booth. The digital recordings had been 
generated by low-pass filtering the amplified microphone signal at 3.4 
kHz, and then sampling and converting it into digital form by a 16-bit 
A/D converter operating at 8-kHz sampling rate. 

For simulating the different input levels, the sentences, all previously 
adjusted to the same mean power level of —21 dBm, were multiplied 
by a constant factor at the coder input, and then divided by the same 
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factor at the output. In this way, each processed sentence was listened 
at the same level, unless channel errors and/or slope overload of the 
coder had caused output level variations. 

From the 288 simulations, two analog test tapes were generated, 
each containing in a different random order two simulations of each 
experimental condition, one with a male and one with a female talker. 
For each talker, 18 different sentences read from a different phoneti- 
cally balanced list were used, so that in each tape the same sentence 
appeared only twice. 


5.3 Testing procedure 


Twenty paid subjects (10 for each tape), all students from junior 
and senior classes of local high schools, judged the 288 stimuli. They 
listened to the processed speech binaurally over Pioneer SE 700 
earphones at a nominal level of 80 dBspz, while seated in a double- 
walled sound booth. As pointed out before, the level of individual 
sentences varied according to the particular experimental conditions. 
The total listening time for each group of subjects was about 30 
minutes, with a short break after the 80th sentence. After each stim- 
ulus, the subjects had 4 seconds to record their judgments. They were 
asked to rate the quality of the stimuli according to the adjectives: 
excellent, good, fair, poor, unsatisfactory. Their answer sheet contained 
144 rows of short lines divided into nine columns, with the odd ones 
labeled with the adjectives. In this way, the subjects were allowed to 
check intermediate ratings, if they chose to do so. 

The categorical judgments expressed by the listeners were subse- 
quently converted into numerical scores, assigning value 1 to the 
category “unsatisfactory,” value 9 to the category “excellent,” and 
intermediate integer values to intermediate categories. 

Before the actual test sessions took place, the subjects listened to 12 
practice sentences different from those used in the experiment, spoken 
by the same four talkers, and representative of the range of quality 
they expected in the test. 


Vil. ANALYSIS OF SUBJECTIVE RESULTS 
6.1 Control variables 


The purpose of the subjective test was to assess the different 
behavior of coders in the presence of different operating conditions, 
namely, channel error rate and input level. Other sources of variability 
in listener responses are expected to cancel out in the average data for 
each experimental condition. Before averaging the data for each ex- 
perimental condition, it was necessary to assess the importance of such 
extraneous sources of variability, as differences in the way the listeners 
judged the stimuli and differences due to talker voices. 
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6.1.1 Listeners 


To assess the variability due to listener differences, their responses 
were analyzed according to MDPREF.'*!’ This is a factor analytic 
procedure that derives a geometrical multidimensional space represen- 
tation, in which the stimuli are represented by points and the subjects 
by vectors. The projections of the points on a vector are the best fit 
with the scores given to the stimuli by that subject. Basically, MDPREF 
reveals whether the subjects attended to different psychological attri- 
butes in the stimuli or if they attached different weights to each of the 
various attributes. In the solution for the 72 experimental conditions 
and the 20 subjects, the first principal component accounted for only 
55 percent of the variance, while the remaining 45 percent was distrib- 
uted over all the other components: 4.2 percent for the 2nd, 3.8 percent 
for the 3rd, 3.4 percent for the 4th, 3 percent for the 5th, etc. 

The fact that 45 percent of the total variance was accounted for by 
sO many axes in an almost uniform fashion indicates that these axes 
do not represent different perceptual attributes of the stimuli, but that 
they account only for the “noise” in the subjective data. In other 
words, in spite of the low variance accounted for by the first axis, it 1s 
evident that the listeners attended essentially to the same attributes 
with the same weights and then only a unidimensional solution exists. 
Therefore, the mean of the listeners’ ratings for each condition were 
used for the subsequent analyses. 


6.1.2 Talkers 


An analysis of variance was computed to study the variability of the 
scores obtained by the talkers of different sex and to assess the validity 
of averaging the ratings across talkers to perform an analysis of the 
effect of the design variables. The analysis showed that the difference 
due to the sex of the talker was highly significant. The average score 
was 4.40 for female talkers and 5.07 for male talkers. On the other side, 
however, all the interactions between the sex of the talker and the 
design variables were not significant. This indicates that the sex of the 
talker influenced the average value of the ratings, but not the relative 
ranking of the various experimental conditions. Therefore, the mean 
ratings across listeners and talkers were used for further analyses, 
reducing the variability of the data to that due to the physical variables 
of the coders and the circuits. 


._ 6.2 Design variables 


In Figs. 3 and 4, the mean ratings across listeners and talkers are 
shown for each bit rate B and decay constant £ as a function of the 
probability of error P(e), with a and the level L as parameters. 
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When no error dissipation mechanism is provided (Figs. 3a and 4a), 
a slower adaptation, i.e., a = 0.96875, makes the system less sensitive 
to the errors. With a slow dissipation, i.e., B = 255/256 (Figs. 3b and 
4b), the slow adaptation is advantageous only at the higher bit error 
rate, while with no errors the performance appears to be worse than 
with fast adaptation. With faster error dissipation, i.e., 8 = 63/64 (Figs. 
3c and 4c) and slow adaptation, the unbalancing of the load factor 
between low and high input level is very high and the performance at 
the low level is very degraded even with no errors. With fast adaptation, 
i.e., a = 0.75, the dynamic range is instead very high; besides, even if 
the performance under error-free conditions is lower than with slower 
error dissipation, the system is very insensitive to channel errors. 
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g. 3—Mean ratings as a function of probability of error, for B = 3: (a) B = 1. 
(b) ae 255/256. (c) B = 63/64 
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Fig. 4—Mean ratings as a function of probability of error, for B = 4: (a) B = 1. 
(b) Be = 255/256. (c) B = 63/64. 


6.2.1 Analysis of variance 


A five-way analysis of variance was performed to evaluate the effect 
of the experimental variables. The results of the analysis are reported 
in Table I. The last column reports the P value, that is, the probability 
that the test statistics assume a value greater than or equal to the 
computed F ratio, under the null hypothesis, compared with the two 
significance levels 0.05 and 0.01. 

The analysis showed that all the main effects except that due to a 
are highly significant. The fact that a has no significant effect means 
that it has a positive effect for certain combinations of parameters and 
a negative one for others, as shown by the significant interactions. 

The interactions between B and a and between B and B were found 
to be not significant, indicating that a and B have the same effects on 
the quality of coded speech whether it is a three-bit or a four-bit one. 
All the other two-way interactions are highly significant. Only the 
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Table I[—Analysis of variance of the mean scores across listeners 


and talkers 
Degrees of Sum of Mean F 
Source Freedom Squares Squares Ratio Significance 

B 1 31.8928 31.8928 204.7 P<0.01 
a 1 0.1733 0.1733 1.11 >0.05 
B 2 4.9167 2.4583 15.77 <0.01 
L 1 19.1570 19.1570 122.96 <0.01 
P(e) 2 85.5395 43.7697 274.52 <0.01 
Bxua 1 0.2854 0.2854 1.84 >0.05 
BxB 2 0.0034 0.0017 0.01 NS 
BxL 1 0.7642 0.7642 4.93 <0.01 
B x P(e) 2 1.8088 0.9044 5.84 <0.01 
axB 2 11.0713 5.5356 35.78 <0.01 
axl 1 3.6933 3.6933 23.87 <0.01 
ax P(e) 2 5.0383 2.5191 16.28 <0.01 
BxL 2 3.8310 1.9155 12.38 <0.01 
Bx Ple) 4 4.7084 1.1772 7.60 <0.01 
L x P(e) 2 1.5342 0.7671 4.95 <0.05 
axBpxLl 2 6.2810 3.1405 20.16 <0.01 
Residual 43 6.9980 0.1558 


interaction between L and P(e) is significant at P < 0.05 but not at P 
< 0.01. This indicates that the level has an effect almost independent 
of the probability of error. 

The significant interaction between B and L is due to the fact that 
the difference between the ratings at the two levels is greater on the 
average for B = 4 than for B = 3. The significant interaction between 
B and P(e) is instead due to the fact that the coder with higher bit 
rate has a greater loss in quality in passing from P(e) = 0 to P(e) = 
1/382. 

Of the three-way interactions, only that among a, 8, and L is 
significant. All the other three- and four-way interactions were not 
significant, and they were pooled in the residual. 


Vil. QUALITY PREDICTION BY OBJECTIVE MEASURES 


To find an objective predictor of the speech quality, linear regression 
procedures were used. A linear model was chosen not only for its 
simplicity, but also because in many cases it proved to be adequate to 
represent the relationship between objective measures and subjective 
quality. A linear relationship exists, for instance, between the simple 
SNR and the quality of speech degraded only by the addition of 
stationary random noise or of speech dependent noise.'*”’ A linear 
relationship exists also between signal-to-granular noise ratio and 
probability of overload, and the quality of speech processed by ADPCM 
coders when no transmission errors are present.” 

To perform regression analyses, the subjective ratings were averaged 
across listeners and talkers, and the objective measures were also 


ADAPTIVE SPEECH CODERS~ 1385 


averaged, taking the arithmetic mean of the values obtained for each 
processed sentence. The gain fluctuation o, was instead averaged 
quadratically, taking the square root of the arithmetic mean of the 
squared values a3. 

Different sets of regression formulas were computed, in which the 
objective performance measures, like signal-to-noise ratios or spectral 
distance measures, were used either singly or in combination with the 
two measures of level mismatching. Figures 5 and 6 show the gain 
fluctuation and the average gain, both averaged across bit rate, as a 
function of the probability of error. A few remarks should be made on 
these figures. Although the two sets of measures have a fairly low 
correlation of 0.55, the patterns are much alike for low input level and 
fast adaptation. For the 18 conditions with L = —33 dBm and a = 0.75, 
the correlation between G and 4g, is, in fact, 0.96. Therefore, even if in 
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Fig. 5—Gain fluctuation, averaged across bit rate, as a function of probability of error: 
(a) B = 1. (b) 8 = 255/256. (c) B = 63/64. 
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Fig. 6—Average gain, averaged across bit rate, as a function of probability of error: 
(a) B = 1. (b) B = 255/256. (c) B = 63/64. 
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a general case the two measures are independent each of the other, in 
this particular experiment a high value of o, is normally associated 
with a high average gain. This is particularly true if the input level is 
low, because, owing to the channel errors, the output level tends to be 
increased. If the input level is high and the quantizer step size is close 
to its maximum value, the output level is likely to increase only to a 
smaller extent. 

The form of the relationship between the two level mismatching 
measures and the loss in quality due to the sensitivity of the adaptation 
algorithms to channel errors was not clear a priori, and therefore 
various nonlinear transformations were tried on those measurement 
data. No transformation on gain fluctuation proved useful in regression 
equations, while a compression of the average gain, given by 


G=G/v\ GI, (27) 


gave better predictions than G, when associated with the other per- 
formance measures. 
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Table II reports the results of the regression analysis. The prediction 
accuracy is indicated by both the correlation coefficient between the 
true and predicted subjective scores and the rms prediction error. 
After each regression analysis, however, a goodness-of-fit test was 
performed to test normality of prediction errors. The Kolmogorov- 
Smirnov test® was used and in each case the hypothesis of normal 
distribution was accepted at the 0.20 significance level. 


Table ll—Formulas for predicting ratings using objective measures 


Formula for Predicting Rating 


1 R= 0.156 snr + 2.702 

2 R = 0.247 SNReeg + 1.369 

3 R = 0.271 SNRecg + 0.260G + 1.071 

4 R = 0.274 SNReeg + 0.1100, + 0.767 

5 R = 0.302 SNReeg + 0.270G + 0.1220, + 0.393 
6 R = 0.336 SNReom — 0.486 

7 R= 0.316 SNReom — 0.394G — 0.228 

8 R = 0.292 SNReom — 0.1790, + 0.569 


9 R = 0.297 SNRiom — 0.337G — 0.0880, + 0.254 
10 R = 0.389 SNRmax — 1.529 


11 R = 0.367 SNRmax — 0.335G — 1.212 

12 R = 0.343 SNRmax — 0.1600, — 0.449 

13 R = 0.345 SNRmax — 0.283G — 0.0880, — 0.668 
14 R = 0.267 snrF; + 3.906 

15 R = 0.251 SNRF, — 0.295G + 3.920 

16 R = 0.219 SNRF, — 0.2680, + 4.606 

17 R = 0.216 snRF, — 0.150G — 0.2350, + 4.527 
18 R = 0.238 SNRF2 + 3.290 

19 R = 0.224 snRF2 — 0.272G + 3.339 

20 R = 0.197 SNRF2 — 0.2490, + 4.050 

21 R = 0.195 sNRF2 — 0.140G — 0.2190, + 3.983 
22 R = 0.307 SNRF; + 9.836 

23 R = 0.286 sNRF; — 0.386G + 9.499 

24 R = 0.244 sNRF; — 0.3230, + 9.461 


25 R = 0.243 SNRF; — 0.194G — 0.2780, + 9.318 
26 R = —6.514 D,; + 7.613 


27 R = —6.035 D,; — 0.474G + 7.345 

28 R = —4.994 D; — 0.38802 + 7.737 

29 R = —4.977 D; — 0.225G — 0.3340, + 7.592 
30 R = —0.519 Dz + 9.154 

31 R = —0.482 Dz — 0.423G + 8.788 

32 R = —0.404 Dz — 0.30604 + 8.797 

33 R = —0.405 Dy — 0.262G — 0.2410, + 8.647 
34 R = 0.436 spr; — 1.268 

35 R = 0.407 spr; — 0.442G — 0.923 

36 R = 0.348 spR; — 0.281, + 0.518 

37 R = 0.353 spR, — 0.304G — 0.2030, + 0.254 
38 R = 0.947 sDR2 — 0.249 

39 R = 0.893 spR2 — 0.513G — 0.028 

40 R = 0.766 spR2 — 0.30604 + 1.331 

41 R = 0.785 spR2 — 0.353G — 0.2120, + 0.998 


Correlation 


0.667 
0.873 
0.881 
0.875 
0.883 
0.911 
0.935 
0.923 


0.938 
0.913 


0.931 
0.923 
0.933 
0.878 
0.891 
0.909 
0.912 
0.887 
0.899 
0.914 
0.916 
0.855 
0.879 
0.905 
0.909 
0.797 
0.837 
0.878 
0.885 
0.826 
0.856 
0.868 
0.878 
0.850 
0.883 
0.885 
0.897 
0.874 
0.917 
0.918 
0.934 


rms error 


1.202 
0.787 
0.765 
0.780 
0.756 
0.665 
0.571 
0.621 


0.561 
0.656 


0.590 
0.622 
0.581 
0.773 
0.731 
0.673 
0.662 
0.744 
0.708 
0.656 
0.647 
0.838 
0.770 
0.688 


0.671 
0.975 


0.883 
0.772 
0.751 
0.910 
0.834 
0.800 
0.773 
0.849 
0.757 
0.751 
0.712 
0.784 
0.642 
0.641 
0.578 
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Among the objective performance measures taken singly, the best 
one turns out to be SNRmax, With a correlation coefficient of 0.913 and 
an rms error of 0.656 [formula 10 in Table II]. The compensated signal- 
to-noise ratio SNR.om gives almost the same results, while all the other 
measures achieve a correlation lower than 0.9. In particular, the 
conventional, long-term, signal-to-noise ratio has a correlation of only 
0.667 and an rms prediction error almost double that of SNRmax. The 
log likelihood ratio D; is the second-worst predictor when used singly, 
with a correlation of only 0.797. 

When the two measures of level mismatching, i.e., the average gain 
and the gain fluctuation, are included in the quality prediction for- 
mulas, the prediction accuracy is significantly improved, the rms 
prediction error having a 16-percent decrease on the average. The 
smallest improvement is displayed by SNRzgeg. Among all the other 
measures, SNRcom gives the best prediction when combined with G and 
og (formula 9), with a correlation of 0.938 and an rms error of 0.561, 
about one-quarter of a category. Formulas 13 and 41, which use SNRmax 
and SDR2, are almost as good as formula 9. The frequency-weighted 
sNnRs also give a fairly good prediction, with correlations over 0.9 and 
the remaining frequency domain measures, D;, Dp, and SDR give a 
slightly poorer prediction. 


Vill. DISCUSSION 


8.1 Effects of coder design parameters 


The subjective data have displayed complicated interactions among 
all the experimental design variables, the strongest interaction being 
the one between the adaptation constant a and the decay constant £. 
In fact, each of these two parameters affects different phenomena: 


(t) The dynamic range is reduced when £ decreases from unity, 
but this reduction does not seem to be perceptible for any £ if 
a = 0.75. If a = 0.96875 and 8 = 255/256, a certain reduction in 
the dynamic range begins to be perceived, producing a loss in 
quality at the low level of about 1.5 points with respect to the 
high level. When a = 0.96875 and 8 = 63/64, the dynamic range 
is reduced still further, and the loss in quality of the low level 
with respect to the high one is very large, the average score 
dropping down from 5.3 to 2.4. 

(it) The effect of the errors is smaller when a increases or B 
decreases. For instance, with a = 0.75, the loss in quality 
passing from P(e) = 0 to P(e) = 1/256 averages 0.49 when B 
= 63/64, while it averages 1.59 for 8B = 1 and 0.93 for B = 255/ 
256. - 
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(ut) The difference in the effect of the errors between the two input 
levels is higher for faster adaptation. For instance, passing 
again from P(e) = 0 to P(e) = 1/256, the difference between 
the losses at the two levels averages 1.0 when a = 0.75, while 
it averages 0.45 when a = 0.96875. 


8.2 Optimum coders 


Given a fixed number of bits per sample, a combination of decay 
constant 8 and adaptation constant a provides the best output quality 
for a given probability of error. 

In the case of error-free transmission, optimum quality should be 
attained with no error dissipation mechanism, i.e., 8 = 1 which pro- 
duces theoretically infinite dynamic range. The parameter a is not 
very critical in that case.° 

When the coder operates under noisy conditions and the probability 
of error is in the range of the values encountered in a normal telephone 
connection or even higher than that (as is the case of P(e) = 1/256), a 
very slow error dissipation associated with fast adaptation provides 
good robustness to channel errors, without impairing the dynamic 
range. Actually, in this experiment, the combination 8 = 255/256 and 
a = 0.75 provided optimum performance even under error-free condi- 
tions. 

If the probability of error is as great as 1/32, more typical of mobile 
radio communications, the best compromise between dynamic range 
and error sensitivity is obtained by a slow adaptation constant, com- 
bined again with a slow error dissipation rate. However the use of 
faster error dissipation and faster adaptation could be almost as good 
for this very high error rate. 


8.3 Objective measures 


One aim of the experiment was to examine a certain number of 
objective measures of coder performance and to compare them in the 
light of the actual subjective quality ratings, obtained under very 
different conditons. With the results of correlation and regression 
analyses reported in Table II, it is possible to observe strengths and 
weaknesses of the different measures and to derive general indications 
on which are the desirable properties of an objective quality measure. 

A first indication that emerges from the experimental data is that 
the conventional long-term SNR is a very poor indicator of the quality 
of ADPCM coders under noisy channel conditions; this confirms the 
results obtained with pcm'® and ADPCM coders’ in the case of error-free 
transmission. Therefore, the use of SNR can be completely misleading, 
when comparing different coders operating under noisy conditions. A 
noticeable improvement in prediction accuracy is obtained simply by 
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measuring signal-to-noise ratio segmentally. Being time-segmental is 
a necessary property of any successful objective quality measure of 
coded speech. 

Table II demonstrates also that, when a coder incorporating an 
adaptive quantizer is operating under noisy channel conditions, an 
objective performance measure must not be sensitive to changes or 
fluctuations of the output speech level. These fluctuations can be 
measured separately and the value obtained can be combined with the 
performance measure to improve the accuracy of the subjective quality 
prediction. It should be noticed, for instance, that G and og have a 
positive coefficient when combined with SNRgeg (formula 5 in Table ID). 
This indicates that the level mismatching is weighted too much in 
SNRseg, Which did not incorporate any level compensation. 

An important consideration on the subject of objective quality 
measures is that results from recent experiments” indicate that fre- 
quency-weighted signal-to-noise ratios improve the prediction accu- 
racy, especially when largely different noise spectra are produced by 
the coders.’ In this experiment, actually the frequency-weighted SNRs 
did not predict the subjective ratings as accurately as the simpler level- 
compensated SNRs, namely, SNRcom and SNRmax. This may depend on 
the fact that the level compensation is effected by minimizing the rms 
error on the whole bandwidth; this fact may worsen the measure in 
same band. If this is the case, it would be a weakness of the frequency- 
weighted SNR’s when measuring coder performance in the presence of 
channel errors. 

A final remark on frequency domain measures: These measures are 
more general than time domain ones because they are insensitive to 
short delays or to phase distortion.”'* Therefore, they are more easily 
applicable to the test of coders whose input and output signals are in 
analog form or which include digital filters. On the other hand, the 
performance measures used here incorporate a spectral noise weighting 
that cannot be directly controlled. However, it is encouraging to see 
that the newly defined spectral signal-to-distortion ratio, SDR2, provides 
a very good prediction of subjective ratings when combined with the 
level mismatching measures. 

More work is needed in the field of objective prediction of coder 
quality. In particular, frequency weighted SNRs should be evaluated 
more carefully, to derive the appropriate frequency weighting mecha- 
nism. In addition, the difference in behavior of the various spectral 
distortion measures need to be analyzed in more depth. 


8.3.1 Estimation of the subjective effect of level mismatching 


8.3.1.1 A Physical Interpretation of Prediction Formulas. The results 
of Table II lend themselves to a nice interpretation. The quality of an 
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adaptive coder operating in a noisy channel environment (high prob- 
ability of error) may be considered as composed of two terms: (t) 
“intrinsic” goodness of the speech reproduction, which takes into 
account the noise due to the coding and to the errors, but not the level 
mismatching, (iz) loss in quality due to the level variations caused by 
the sensitivity of the adaptation algorithm to channel errors. In for- 
mulas, we can write 


R=R,-— R,z. (28) 


R,; can basically be estimated by any of the performance measures 
which incorporate gain compensation or, in any case, which are not 
sensitive to alterations in the output signal level. Rz is instead esti- 
mated by a linear combination of the gain fluctuation and the average 
gain, modified according to eq. (27). 
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Fig. 7—Predicted overall rating R and “intrinsic” goodness Rz, as a function of 
probability of error, for 4-bit ADPCM and different combinations of design parameters: 
(a) a = 0.75, B = 1. (b) a = 0.75, B = 255/256. (c) a = 0.96875, B = 255/256, 
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8.3.1.2 An Example. In the light of the interpretation given in the 
previous section, it is possible to give at least a qualitative answer to 
the question on which is the subjective effect of level mismatching. 
Figure 7 shows the predicted overall rating R and the “intrinsic” 
goodness R; as a function of the probability of error, for the 4-bit 
ADPCM coder with low level input and three different combinations of 
design parameters. R was computed according to formula 41 in Table 
II, while R; was computed discarding the terms involving G and o, 
from the same formula: 


R, = 0.785 SDR2 + 0.998. 


In the case of fast adaptation and absence of error recovery (Fig. 
7a), the loss in quality due to level mismatching can be estimated as 
half category (1 point) for the intermediate error rate and 1 category 
(2 points) for the high error rate. In the case of fast adaptation and 
slow error recovery (Fig. 7b), the loss Rz is instead reduced to about 
only half category (1 point) for the high error rate. 

Finally, when slow adaptation and slow error recovery are used, 
simultaneously (Fig. 7c), the loss due to level mismatching can be 
estimated as only about 0.4 point, i.e., less than a quarter of a category. 
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This paper introduces a nonstationary model for images and 
develops an adaptive intrafield ppcm codec based upon the model. 
The codec attempts to minimize the mean-square coding error at 
each sample point in the picture. The quantizer in the resulting 
adaptive codec is found to be similar to that previously obtained from 
visual masking considerations. Comparative simulation results using 
256 X 256 pixel rasters are given for two- and three- bit/pixel versions 
of the adaptive codec, the three-bit/pixel Graham codec, and three- 
bit/pixel previous element DPCM. 


1. INTRODUCTION 


This paper introduces a nonstationary model for images and devel- 
ops an adaptive intrafield codec based upon the model. The codec 
adaptively estimates both the mean and probable range of values of 
the next picture sample to be encoded and adapts the predictor and 
quantizer accordingly. In so doing, the coder attempts to minimize the 
mean-square coding error (MMSE) at each sample point in the picture. 
The MMSE distortion measure is generally acknowledged to be a poor 
indicator of image quality.’ However, when it is applied on a point 
(rather than area) basis in conjunction with the image model presented 
here, the coder adaptation and resulting coding quality are found to be 
comparable to that previously obtained from visual masking consid- 
erations. This result follows from a property of human vision, stressed 
by Graham,” concerning the strong connection that exists between 
image chaos (unpredictability) and the visual system’s tolerance to 
noise-like coding distortion. Because of this property, we obtain good 
image quality at two bits per pel and excellent quality at three bits per 
pel in a DPcM codec designed solely using the MMSE criterion—masking 
phenomena are in large part accounted for automatically when the 
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source model more adequately represents actual images and when the 
distortion criterion is applied on a point basis. 


ll. SOURCE MODEL 


This section introduces a nonstationary causal source model for the 
intrafield video process that will be used to develop the adaptive 
predictive intrafield codec of Section IV. Motivation for the model is 
intuitive and follows from an examination of a representative video 
signal of the type we wish to encode, such as that shown in Fig. 1. This 
is a frame of two interlaced fields, each having 256 pixels per line and 
128 lines, with amplitudes stored as 8-bit quantities. The essential 
characteristic of this (and any) image is that it is a projective transfor- 
mation of a collection of physical objects. As a consequence, the image 
is partitioned into regions of luminance elements whose amplitudes 
are interrelated by the physical structure of the objects they represent. 
The result is an array of pixels composed of distinct regions having 
slowly varying “brightness” and “texture” with abrupt boundaries (the 
picture outline) separating one region from another. We find it natural 
to view this array as a field that is partitioned into regions of inde- 
pendent quasi-stationary subfields. Two underlying random phenom- 
ena are involved: the random amplitudes of picture elements within a 





Fig. 1—Checker girl original. 
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given subfield and the random selection of the subfield with respect to 
raster coordinates. A source model that incorporates both phenomena 
is shown in Fig. 2. 

Figure 2 models the image generation process as a composite of Q 
autoregressive sources, g = 1, 2, ---, Q, and one white source, g = 0. 
Switches S1 and S2 determine which source generates output lumi- 
Nance Smn = S;, where m and n are, respectively, the line number in the 
field and the column number of the pixel and ¢ is the time the pixel is 
encountered during conventional line scanning. The autoregressive 
sources, characterized by predictors 1 through Q and “innovations” 
process Wmn = W,, provide a set of @ possible processes from which the 
regions of slowly varying brightness and texture of a subfield in an 
actual image can be approximated. The random variables uw, are 
assumed zero mean, independent, and characterized by a single known 
probability density function (pdf) g(w). The predictors F, in sources 
q= 1,2, ---, Q are taken to be linear functions of pixels from the local 
past neighborhood of coordinate (m, n) = t. Section IV discusses the 
specific predictors chosen for the codec of this paper (Table I). Source 
0 models those pixels of an actual image that either have no structural 
relation to previous pixels or whose relation to these pixels is not 
adequately modeled by sources 1 through Q. Such pixels tend to occur 
in highly chaotic regions of the image and at certain boundaries at 
which new subfields are initiated. Since this source represents the 
extreme of chaos possible in an image, its output is taken to be a 


RANDOM Fo 
SOURCE 
51 5, 
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NO. 1 
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NO. 2 


PREDICTOR 
NO. 0 






Fig. 2—Source model. Switches S1 and S2 are governed by eq. (2) in the text. 
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sequence of random variables Fo = Fo(m, n), each uniformly distributed 
over [0, 255]. 

Switches S1 and S2 of the model determine which source is used for 
final output at each raster coordinate and thus determine image outline 
as well as more subtle structural changes. “Outline” and “subtle 
structural changes” are subjectively perceived qualities of an image 
that are difficult to quantify probabilistically. However, in an actual 
image “structure” tends to vary slowly: boundaries between regions 
are an exception, but even here the discontinuity is generally only 
along one dimension. This suggests that probabilistic information 
regarding the source in operation at time ¢ can be inferred by appro- 
priate processing of the pixels in the local past vicinity (in the same 
field) of the pixel in question. To arrive at a source model that 
characterizes this quasistationary in the simplest way, we model the 
image source as choosing sources, g = 0, 1, 2, ---, Q, independently 
according to unknown first-order probabilities P[g; (m, n)] that are 
slowly varying functions of coordinates (m, n). We further assume that 
the @ textures generated by sources g = 1, 2, ---, Q are a priori 
equally likely for a random choice of coordinate (m, n): 


E{P(q; (m,n)}} =e q=1,2,---,Q (1a) 
and 
E{Plq; (m,n)]} =e<ec q=0, (1b) 


where c and € are constants satisfying « + Qc = 1 and the expectation 
is taken over the raster coordinates. 

An alternative approach would be to model the sequence of gmn as 
stationary Markov. However, this approach was not taken since the 
assumption of nonstationary independent q leads to a relatively simple 
codec that is robust with respect to both varied picture inputs and 
channel errors. The assumption of equality of expectations in (1b) 
leads to mini-max performance with respect to variations in textural 
content of the picture to be encoded. Biasing this a priori distribution 
toward one predictor would make the codec more susceptible to poor 
performance on a picture which does not match this distribution. The 
problem faced by the codec is to estimate probabilities P[q; (m, n)] by 
suitable processing of past pixel outputs and use the estimates to best 
advantage for bandwidth compression. 

To summarize, the proposed model of the video process has the 
form (Fig. 2) 


«, = |Fe with probability PC; 2) 
‘| F, + wz, with probability P(g; t), 1<q = Q, (2) 


where Fp is an independent random variable uniform over [0, 255], Fg 
is a given linear function of pixels in the local past neighborhood of s; 
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(Table I), and w; is an independent zero mean random variable 
characterized by probability density function (pdf) g(w). Probabilities 
p(q; t) gq = 0, 1, ---, Q@ vary slowly with respect to at least one 
coordinate of the raster and satisfy (1); otherwise, these probabilities 
are unknown. Note that the model embeds the elusive variety of gross 
image structure in the unknown probabilities P(q; t), 0 = g = Q. These 
represent the probabilistic information that the encoder hopes to learn 
by suitable processing of past image source outputs. 

Figure 3 illustrates a representative output generated by the model. 
In obtaining this output, g(w) was assumed Laplacian, and the P(q; t) 
were estimated from the image of Fig. 1 by a procedure described in 
Section III. Significant increases in structural similarity to Fig. 1 are 
possible by modeling the sequence of uw; as nonstationary. In the 
interest of codec simplicity, however, this additional complexity is not 
included in the model. 


ltl. ANALYTICAL DEVELOPMENTS 


The design of the ppc codec of Section IV requires specification of 
both the quantizer and the predictor. Complete statistical information 
pertinent to this design is contained in the conditional pdf of s,; given 





Fig. 3—Representative output of source model. Output of the source model of Fig. 2, 
where w, is Laplacian and probabilities P(q; t) of eq. (2) are estimated from Fig. 1. 
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the set of past pixels {s,, t< t} = S;. This section describes how this 
conditional pdf can be etimated at the source output. 

It can be easily shown that, for given P(q; t), g = 0, 1, ---, Q, the 
probability density of s; conditioned on S; is (for 0 = s; S 255) 


PO;t) , & em 
355 2, Ho — FulS) PUG: 8, (3) 





P(s:| Sr) = 


where F,(S;) denotes the gth predictor F, of s; as an explicit function 
of past pixels S;. An estimate of density function (3) is obtained by 
replacing P(q; ¢) in the above by its estimate, as described below. 

Let the number of times the gth source had been output in a local 
past region R,; of N points neighboring (m, n) = ¢ (Fig. 4) be denoted 
by n(q). Due to the nature of the source model, n(q) cannot be 
measured at the source output. However, a reasonable and computable 
approximation to it is given by the expectation E{n(q)|S;}, where the 
expectation assumes a random selection of (m, n) and is over the 
density (w;). By the quasi-stationarity of P(q; t), we then set 


57.  LAng)| Sr} 
P(q; t) = N ; (4) 
which becomes (appendix) 
P i eas 
P(q; t) = NV » P(g = @| Sr), (5) 
j=l 


where P(g; = q|S7) is the conditional probability that the jth pixel in 
R, (Fig. 4) was output by source g based upon a priori probabilities 
E({P{q; (m, n)]} of (1). Further manipulations (appendix) give 


oO Oo .0 0 
Or “QO: 0 





Or Ob. +O> (Oo -@ 
Or Oe Oe Os: 3 
On. OO Or - 0 


~ 
~ PIXEL t = (m,n) 


Fig. 4—Illustration of region R,. This region consists of N pixels in a local past vicinity 
of pixel (m, n). Note that the numbering of coordinates j = 1, 2, ---, N is arbitrary, as 
are the region boundaries. 
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: € 
P(g; t) = Ko q=0 (6a) 
7 KC = : 
P(q; t) = W ¥ gle(és)) 1sqeQ, (6b) 
j=l 
where K satisfies 
Q 
> P(g; t) =1. (7) 


I 


q=0 


Equation (5) interprets P(g; t) as an arithmetic average of a posteriori 
probabilities of g over region P;, and eq. (6) show how this average can 
be computed. The term e,(¢; 7) in (6b) is the difference between the 
actual value of the jth pixel in region R, and the predicted value of 
this pixel given by predictor F’, and is therefore the implied value of 
the 7th innovations variable in R; under the hypothesis that predictor 
q was in operation at the source. Explicitly, 


e,(t; 7) = si — F,(S}), (8) 


where s/ is the jth pixel in R, and S} is the set of pixels previous to 
s4. Equation (6b) estimates P(q; t), by summing the relative probabil- 
ities of the innovations implied under the hypothesis that source q was 
in operation over region R,. Note that if g(-) has its peak at zero, then 
P(q; ), 1 <q = Q, will be large for those g corresponding to small 
prediction error e,(¢, 7) over the N point region. If none of the Q 
predictors is consistent with past local data, then all terms in the sum 
of (5b) will be small for 1 < g S Q, and the normalization in (6) will 
make P(0; ¢) large. Further description of (4) to (7) is included in the 
derivation in the appendix. 

The codec described in Section IV predicts s; by the estimated mean 
of predictable source outputs: 


F(S7)P(q; t) 
1 


= aoe (9) 
> P(q; t) 
q=1 


IMea 


An important characteristic of this prediction rule is its insensitivity 
to small variations in data S; regardless of the relative values of N 
and Q. This is in contrast to the covariance method in linear prediction 
described in a review paper by Makhoul? in which small sample size 
can lead to an ill-conditioned system of equations whose inversion is 
the adapted predictor. Since (9) is a weighted average of stable (and 
generally good) estimates F,, stability persists even for N < Q, and 
some thought indicates that the resulting prediction of s, works in an 
intuitively reasonable way even if N is only unity. 
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IV. THE CODEC 


In this section, a codec resulting from the source model is described. 
A block diagram of the encoder is shown in Fig. 5. The codec has been 
used to code pictures using two and three bits per pel. 

The encoder operates by forming Q estimates F,(X;), 1 = q = Q, of 
source output S, based upon the previously reconstructed field ele- 
ments X;. Estimates of source probabilities P(q; t), 0 = q = 1, are 
made with eq. (6) to (7) using previously reconstructed pixels X4 in 
place of S?. Estimates F,(X,) and probabilities P(q; t) are used to 
predict the next encoder input pixel s;, according to (9) and the most 
likely distribution of values s; according to (3), with X; replacing S; . 

The encoder has been implemented using an N = 4 point learning 
region R, (Fig. 6) and Q = 6 predictors. The predictors used are given 
in Table I. 

Note that with these six predictors the form of predictor (7) can be 
any one of the most common fixed predictors used in intrafield coders. 
This varies over the picture so that the best predictor (or best weighted 
sum) considering the recent past will be used at each sample point. 

The pictures which were encoded consisted of 256 lines in two 
interleaved fields and 256 samples per line. The previous line elements 
were taken from the previous line in the same field. In this environ- 
ment, no advantage was obtained by including elements more than 
one line away in the estimates. Similarly, no visible improvement was 
obtained using elements that were more than two elements away on 
the same line. The slope estimator, F’;, and the planer estimator, Fe, 
were found to be particularly useful in the system which uses two bits 
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Fig. 5—The encoder. 
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Fig. 6—Four-point region R, used by codec. 


Table | 


F\(n, m) = x(n — 1, m). 

F.(n, m) = x(n — 1, m — 1). 

F3(n, m) = x(n, m — 1). 

Fi(n, m) = x(2 + 1, m — 1). 

F;(n, m) = 2X(n — 1, m) — x(n — 2, m). 

Fe(n, m) = x(n — 1, m) + X(n, m — 1) — X(n -— 1, m— 1). 


per pel. These estimators allowed the coder to respond more quickly 
to edges within the picture, and reduced slope overload. 

Ideally, the quantizer should be adapted at each point to the esti- 
mated probability distribution of s;. In view of the complex form of (3), 
this type of redesign is not feasible, and the following ad-hoc curve- 
fitting technique was used to simplify the adaptation algorithm. The 
density function g(w) was taken as Laplacian, g(w) = a/2 exp(—a|w|). 
The Max quantizer’ for this was determined. Each side of the distri- 
bution (3) about the mean §s; was then approximated by an exponential 
distribution, and the axis was simply scaled appropriately in codec 
operation to place the quantization levels. The parameter of each 
exponential distribution was selected so that it had the same first 
moment about S; as the corresponding portion of the actual distribution 
as described in Fig. 7. 

When the estimated probability of occurrence of the random esti- 
mator is near zero and the estimators g = 1 through 6 are identical 
corresponding to a perfectly flat region in the picture, the parameter 
of the exponential defining the quantizer assumes its smallest value. 
In this situation, the parameter of the exponential defining the quan- 
tizer is approximately a, the parameter of the Laplacian distribution 
defining the innovation term in the model. Therefore, a determines 
the minimum values of the levels of the quantizer, and these, in turn, 
determine the amount of granularity due to quantization noise in flat 
regions of the picture and the ability of the coder to respond to 
unexpected edges. The smaller the value of a, the lower the granular 
quantization noise; the larger the value of a, the quicker the coder can 
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Fig. 7—Illustration of encoder’s derivation of predictor and quantizer. Fo refers to the 
distribution of the white source output; s, is the weighted sum predictor; m, and m. are 
the upper and lower first moments of the actual distribution about s,; and c, and c, are 
the exponential distributions used to determine the quantizer. 


respond to edges. Because of this interaction, the values of a were 
selected experimentally based upon visual examination of a sequence 
of coded pictures for the two- and three-bit/pel quantizers. For the 
two-bit/pel quantizer, a was selected so that the minimum value of 
the inner quantizer level is equal to two picture levels, when the picture 
is initially quantized into 256 levels. For the three-bit/pel quantizer, 
the inner quantization level was selected so that the inner quantization 
level is equal to one picture level. 

The random variable Fp in the source model of Fig. 2 is uniformly 
distributed over the range of possible values the sample can assume. 
In implementing the codec, it was found to be desirable to assume that 
the range of Fo is somewhat reduced. Limiting the span of Fo is 
particularly necessary in the system which transmits two bits per pel. 
This can be seen as follows. Assume that probability P(0, t) is estimated 
by the encoder to be close to unity. In this situation, if Fo has range 
[0, 255] the four quantization levels will be spread over the entire range 
of possible sample values. It is then likely that none of the estimators 
will be close to the reconstructed value x; even though an estimator 
can have closely approximated the actual value s;. Thus, the random 
estimator may be used for the next sample. This creates an instability 
in the coder which can propagate into flat regions of the picture. To 
eliminate this type of instability, the maximum range of the quantizer 
was limited. To be consistent with limiting the maximum range of the 
quantizer, the span of Fo was limited to a symmetrical region about §; 
of (9). The maximum span of the quantizer was also set experimentally. 
In the two-bit/pel system, the maximum span of the quantizer was set 
so that the inner level of the quantizer is eight picture levels. And in 
the three-bit/pel system, the maximum span of the quantizer was set 
so that the inner level in the quantizer is four picture levels. In the 
two-bit/pel system, there are only two quantization levels on each side 
of the predicted value. In this system, the maximum span of the 
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quantizer determined the ability of the encoder to track sudden 
changes in the picture. Therefore, it is necessary to make the maximum 
quantizer span as large as possible, without making the encoder 
unstable. In the three-bit/pel system, four quantization levels are on 
each side of the predictor. In this system, restricting the maximum 
quantizer span was necessary to prevent the quantizer span from 
frequently exceeding the range of possible picture levels and wasting 
quantization levels. This is why a smaller maximum value of the inner 
quantization level was selected for the three-bit/pel system than for 
the two-bit/pel system. 

In Figs. 8 and 9, the quantization span for various parts of the 
picture in the two- and three-bit/pel systems is shown. In these 
pictures, the average of the upper and lower quantization spans is 
displayed. The white areas correspond to the smallest span of the 
quantizer and the black levels the largest span. It is interesting to note 
that the resulting quantizer adaptation is similar to that which would 
be expected if a masking function were used.” However, this quantizer 
adaptation was arrived at strictly by mathematical techniques, mini- 
mizing the expected point mean-squared error with a varying proba- 
bility distribution of next sample values, rather than by the psychovi- 
sual considerations used to derive masking functions. This result is 





Fig. 8—Quantizer range adaptation of the two-bit/pel codec. 
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Fig. 9—Quantizer range adaptation of the three-bit/pel codec. 


consistent with Graham’s early observations concerning the strong 
connection between image chaos and the visual system’s tolerance to 
noise-like distortion.” 


V. RESULTS 


This adaptive predictor with a two- and three-bit/pel quantizer has 
been implemented and compared with an adaptive predictor using 
Graham’s rule? and the three-bit/pel fixed quantizer suggested in the 
Graham paper, and a previous element DPCM encoder with a fixed 
three-bit/pel quantizer. Our two-bit/pel adaptive predictor has consid- 
erably less slope overload than the previous element predictor having 
three bit/pel quantizer, but is not quite as good as the Graham 
predictor having a three bit/pel quantizer. Our adaptive predictor with 
a three bit/pel quantizer has less slope overload than the Graham 
predictor with a three bit/pel quantizer. In addition, the estimates at 
edges within the picture are accurate enough to virtually eliminate the 
edge business in moving sequences which is characteristic of many 
adaptive predictors. To demonstrate these characteristics, the differ- 
ence between the original picture of the checker girl, Fig. 1, and the 
result of processing by these four techniques is shown in Figs. 10 to 13. 
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Fig. 10—Two-bit/pel codec performance. Top: Decoder output. Bottom: Difference 
between decoder output and original. 
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Fig. 11—Three-bit/pel codec performance. Top: Decoder output. Bottom: Difference 
between decoder output and original. 
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Fig. 12—Performance of three-bit Graham codec. Top: Decoder output. Bottom: 
Difference between decoder output and original. 
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Fig. 13—Performance of three-bit previous element Dpcm. Top: Decoder output. 
Bottom: Difference between decoder output and original. 
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APPENDIX 


This appendix traces the development from eq. (4) to (7) of Section 
III. 

There are M = (Q + 1)” possible vectors V = (qi, 92, --+ Gj *-@N) 
of source options in the N point region R, of Fig. 4. Number these 
vectors 2 = 1, 2, ---, M and let V; denote the ith vector. Define n(q | V;) 
as the number of components in V; that equal the specific value gq. 
Then (4) becomes 





Pas) = Po ) 
Lee = 
a 2 n(q| V)P(Vi| Sz). (10b) 


In taking the expectation in (10a) we have treated t = (m, n) asa 
randomly chosen raster point for which P(q) = E{P(q; t)} of eq. (1) 
applies. Because the qg are selected independently, P(V;|Sz) of (10b) 
is related to P(q) by 


F P(S;|Vi) =~ 

P(V;| S7) - So Pw) (11a) 
_ P(ST|Vi) * ~ E 
=—pey Ll P(qi/), (11b) 


where p(S; | (-)) denotes the probability density function of the vector 
of values in S;, and qj is the jth component of V;. [The random 
selection of ¢ cannot affect the independence of the components of V; 
in (11a); hence, (11b)]. 

Substituting 


N 
n(q| Vi) = & 8q-a, (12) 
j=! 
into (10b) and summing over 1, (10b) becomes 
z i ae 
P(q; t) = WV YY P@ =@|S-), (13) 
j=l 


where P(q; = q|S7) is the conditional probability that the jth pixel in 
R, was generated by source q. Note that (13) gives P(q; t) as an average 
of a posteriori probabilities of g over the region R;, where 


= P(S; |q;= e 
Pg; = 9\87) =F 2=? Pig) (14) 


We now partition the set of pixels. S; into (7) a subset of pixels future 
to s; but past to S; (call it S}); (ii) the pixel s;; and (iii) a subset of 
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pixels Si past to pixel s;. The elements in S} when conditioned on s; 
and S’ do not depend upon gq;, and it follows after straightforward 
manipulations that 


P(g = 9|S7) = K als — FA(S))P@; 1sq==Q, 


and 
4 : 255 ; 


where K is a normalizing constant. Substitution of eqs. (15) and (1) 
into (10) yields (5). 
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Reduction of Transmission Error Propagation 
in Adaptively Predicted, DPCM Encoded 
Pictures 


By N. F. MAXEMCHUK and J. A. STULLER 


(Manuscript received November 21, 1978) 


A new technique for reducing transmission error propagation in 
adaptively predicted, DPCM-encoded pictures is described. The basis 
for the technique is a generalization of the notion of predictor output 
attenuation, described by Graham, to include attenuation of the 
adaptive prediction function. Simulation results are presented that 
show that application of the technique to Graham’s codec results in 
significant reduction in error propagation without degradation of 
picture quality. The technique requires no increase in transmission 
rate. 


I. INTRODUCTION 


This paper presents a new and simple technique to reduce error 
propagation in DPCM image coders that employ adaptive switching- 
type prediction. An analytical performance description of this tech- 
nique has not been obtained. However, simulation results using the 
Graham’ adaptation algorithm are presented that demonstrate that— 
in this case, at least—the technique can provide substantial reduction 
in channel error propagation without decreasing the transmission rate. 

The class of coders considered is those which adaptively choose one 
of Q fixed predictors Fy, gq = 1, 2, ---, Q, according to a decision rule 
that operates on the previously reconstructed pixels in the local past 
vicinity of the element to be predicted. If x;; is the ith pixel on the jth 
line of the input raster, and y,; is the vector of reconstructed pixels in 
the local past vicinity of (z, 7), then the adaptive predictor has the form 


xy = F(yu), (1) 


where F(-) is one of @ fixed functions F,(-), gq = 1, -++, Q, with qg 
chosen according to a decision rule operating on yy, 
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q = D(yy/). (2) 


The encoder and decoder use the same decision rule to determine g, 
but the decoder must base its decision upon its possibly contaminated 
version of the reconstructed past scene. 

An example of (1) and (2) is given by Graham’s predictive encoder. 
Here Q = 2 with 


Fi (yy) = yi — 1,7) 
F(yy) = y(t, 7 — 1) (3) 


and 


1: it | yey og =A) 
2; otherwise. 


It is well known that, for a fixed transmission rate, adaptive prediction 
generally results in a more accurate coded version of the image, 
particularly on edges within the picture where large changes in ampli- 
tude occur along one dimension. However, a serious problem generally 
arising from such adaptation is the response of the system to channel 
errors. Generally, the effect of an error propagates over a larger area 
of the picture when an adaptive predictor is used than when a fixed 
predictor is used. This occurs because transmission errors not only (2) 
contaminate the value of the elements used by the receiver in the 
function F(-) when the receiver’s choice of g is correct, but can also 
(ii) cause an error in the receiver’s choice of g. Note that effect (7) is 
present in nonadaptive coders and is defined as occurring in adaptive 
coders when the correct choice of g is made by the decoder. Effect (iz) 
is unique to adaptive prediction and is potentially more grievous since 
the transmitter and receiver then use different choices for the predic- 
tion function F(-)—a result that, once started, can propagate. An 
example of the effect of transmission errors in adaptive DPCM is shown 
in Fig. 1. In this example, Graham’s three-bit codec is used over a 
binary symmetric channel having bit error probability of 10~*. Figure 
2 shows the difference between the output of this system with and 
without transmission errors. 


ll. PREDICTOR OUTPUT LEAK 


As observed by Graham and others, the effect of transmission errors 
can be reduced by attenuating the predictor output by a constant a, 0 
<= a <1. In general, a bias term can be introduced so that x;,; assumes 
the form: 


Xi = AF (yi) + (1 — @)n, (5) 
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Fig. 1—Effect of 10~* channel bit error probability on output picture: Graham 3 bit/ 
pel codec. 


where 7 is a constant in the span of possible picture values. A possible 
choice of 7 is the mean of x,, 


n = E{xy}. (6) 


Other choices, however, can give subjectively better results depending 
upon context and system nonlinearities. 

Equation (5) can be viewed as a weighted combination of locally 
inferred and globally given knowledge about x,. A value of a < 1 has 
the effect of decreasing the memory of the closed reconstruction loop, 
and the bias term causes the output to tend toward 7. The quantity 
quantized and transmitted has the form 


xy — Xi = alxy — F(yy)] + (1 — a) [xy — 9), (7) 


which is seen to consist of both DPcM and PcM information. As a varies 
from one to zero, the system changes from DPcM to PcM. Therefore, 
attenuation of predictor output as in (5) trades error-propagation 
attenuation with transmission rate. It should be observed that the 
technique described by (5) will also reduce error propagation in non- 
adaptive codecs. Also, the technique does not directly address the 
problem of the choice of g and is therefore a remedy more closely 
connected to type (z) errors than to type (ii). 
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Fig. 2—Difference between output of Graham codec with and without transmission 
errors. 


Using the Graham system, it was found that a could be reduced to 
“As before degradation in the output caused by quantization noise 
becomes visible. The reduction in error propagation resulting from (5) 
(with 7 set to 128) is shown in Fig. 3. The difference of the coded 
picture with and without channel errors is shown in Fig. 4. Although 
an improvement is obtained with this approach, the next section shows 
that substantially greater improvement is possible. 


Ii. PREDICTION FUNCTION LEAK 


Since the second effect of channel errors in an adaptive codec is to 
make the value of g uncertain, the receiver loop must in fact estimate 
the function F(-) as well as x,. In analogy with (5) we introduce a 
constant 6, 0 <= B <1, and set (at both transmitter and receiver) 


F(-) = BF(-) + (1 — B)F(-), (8) 


where F(-) is a fixed predictor. A reasonable choice for F(-) is the 
mean of F,, 


Q 
F(.) = ¥ Fi()P@), (9) 
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Fig. 3—Reduction of transmission error propagation using predictor output leak (a 
= !%«, 1 = 128, 10“ bit error probability). 


where P(q) is the a priori probability of g. Other choices for F(-) are 
possible. 

Note that, as B varies from one to zero, a system using F(-) will 
change from fully adaptive DPcM to nonadaptive DPcM. The smaller 
the value of £, the closer the predictor to being fixed, and the smaller 
the effect of error propagation due to using the wrong predictor. Note 
also that (9) is an approach that is applicable only to adaptive codecs. 
Because of this, we view this technique as a remedy for the second 
error class (11) described in Section I. 

The concept of prediction function leak has been applied to the 
Graham predictor and has successfully reduced error propagation. The 
predictor used in this experiment is: 


ky = F(yy), (10) 
where F‘(-) is given by (8) using (3) to (4) and 
F( yy) = 2(y(i — 1,7) + y(t, 7 — 1). (11) 


It was found experimentally that 8 could be reduced to be between % 
and % (depending upon the picture) before the ability of the adaptive 
predictor to respond to edges within the picture was compromised. 
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Fig. 4—Difference between output of codec with and without transmission errors (a 
= '%6, 7 = 128). 





Fig. 5—Reduction of transmission error 
prediction function leak (a = '%6, B = %, 7 = 128, 10 * bit error probability). 
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Fig. 6—Difference between output of codec with and without transmission errors (w 
Mc, B = %4s, 1 = 128). 





Fig. 7—Reduction of transmission error propagation using predictor output and 
prediction function leak (a = %s, B = 4, 7 = 128, 10“ bit error probability). 
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Fig. 8—Difference between output of codec with and without transmission errors (a 
= '%6, B = %, n = 128). 
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Fig. 9—Quantizing noise of Graham codec. 
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Fig. 11—Quantizing noise for a = '%s, B = %. 
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Fig. 12—Quantizing noise for a = 6, B = 4. 


When prediction function leak is used, it is possible to use predictor 
output leak to further reduce the effects of errors. The estimator is 
formed as: 


Ki = aF (yz) + (1 -— an 
= aBF (yj) + a(1 — B)F (yy) + (1 — @)n. (12) 


Pictures with the same transmission error patterns as those that 
occurred in the pictures produced using the original Graham predictor 
were processed using (12). In Fig. 5, a = 6, 8B = % and y = 128. Figure 
6 is the difference between pictures processed with this predictor with 
and without transmission errors. In Fig. 7, a = 6, B = %, and yn = 128. 
Figure 8 is the error difference picture of this coder. As seen from these 
pictures, leaking the prediction function significantly reduces the effect 
of transmission errors. Analysis of this effect has been hindered by the 
nonlinear nature of the equations and the fact that the quantities 
involved exist on a two-dimensional field. 

Figures 9 through 12 show the encoding (quantizing) noise for each 
of the systems previously described in this paper. By comparing Fig. 
9 with 11 and 12, it can be seen that, for 8 equaling % and %, prediction 
function leak reduces encoding noise along edges within the picture. 
In fact, pictures transmitted with these values of 8 over an error-free 
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channel are preferable to those transmitted with the original Graham 
codec since edge serration, which sometimes occurs when a switching- 
type predictor is used, is reduced. Figure 10 shows that the introduction 
of predictor output leak alone does not produce the same beneficial 
effect. 
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Application of Optimization Theory to the 
Control of the Optical Fiber Drawing Process 


By D. H. SMITHGALL 
(Manuscript received January 15, 1979) 


The optical fiber drawing process is examined and a feedback 
control loop identified. The incremental dynamic response of each 
loop component is determined, and the sensitivity of loop response to 
system parameters is examined. The control loop is optimized, based 
upon a mean square error criterion with constraints imposed for 
periodic disturbances. An expression is derived for the effectiveness 
of the control loop with respect to sources of system disturbance and 
found to correlate well with experimental results. 


|. INTRODUCTION 


With the advent of fiber optics technology has come the potential 
for use of this technology in high quality telecommunications systems. 
Such systems require sources, detectors, and fibers superior to those 
used in the present applications. The fibers in high quality systems 
must have low loss and dispersion and yet be economically produced. 
One factor influencing transmission loss in the fiber, particularly at 
splice locations,’ is the diameter uniformity. Diameter uniformity is 
directly related to the manufacturing process and is influenced by the 
environment in which it is drawn’ as well as the material from which 
it is drawn. In additon, large variations in diameter occur during the 
startup operation, resulting in a material loss of up to 10 percent of the 
potential fiber. 

Much of this wastage can be eliminated and a high degree of fiber 
uniformity maintained by the judicious design and application of a 
feedback control on the fiber drawing process. Optimization of such a 
control requires identification of the distributed, nonlinear drawing 
process and a quantification of the sensitivities of the process to 
changes in process parameters. 


1425 


1.1 The fiber drawing process 


Optical fiber is formed by locally and symmetrically heating a 
cylindrical preform, typically 7 to 25 mm in diameter and 30 and 60 cm 
in length, to a temperature in the neighborhood of 2000°C. As the 
preform of diameter D, is fed into the heat zone at a velocity V,, the 
fiber is drawn from the molten material at a velocity V;, as shown in 
Fig. 1. Due to the temperatures involved and the tolerances required, 
the fiber cannot be drawn through a die, and consequently the surface 
of the molten material is a free boundary whose shape is determined 
by an equilibrium between the velocity shear gradients and the re- 
straining surface tension. The diameter of the fiber, D,, is determined, 
then, by the principle of conservation of mass, which may be written 
as 


t 
D? V;= D2 V, + i dw, (1) 


where w is a random process representing mechanical and thermally- 
induced disturbances as well as the variations in diameter which occur 
while the process is establishing its equilibrium condition. The nature 
of the disturbing influences is illustrated in Fig. 2, where the fiber 
diameter is plotted for a 500-m length of fiber with constant D,; Vp, 
and V;. An additional source of diameter variation results from changes 
in preform diameter which are of a slowly varying nature. Once the 
process “equilibrium” has been established, the noise (diameter vari- 
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Fig. 1—Geometry of the optical fiber drawing process. 
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ation) which appears on the fiber can be characterized as band-limited 
noise riding on a slowly varying bias. 


1.2 Identification of process dynamics 


Since the fiber diameter is related directly to the two manipulable 
variables V; and V, by the mass conservation principle, and only 
indirectly to other manipulable variables such as heat source temper- 
ature or heat flux, these are the system variables through which a 
control signal can most effectively be coupled into the process. Exper- 
imental results show that the dynamic response of the fiber is two 
orders of magnitude faster with respect to the drawing velocity than 
to the feed velocity. Consequently, the control loop shown in Fig. 3 
has been determined to be the most effective means of controlling 
fiber diameter. 

The motor-drawing mechanism is typically a pinch-wheel device in 
which one wheel is driven by a DC motor. The dynamic response of 
the mechanism used in this investigation is modeled as 


V(s) | 0.31 meters/second 
u(s)  0.0045s? + 0.030s + 1 volt 


(2) 
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Fig. 2—Diameter profile of uncontrolled fiber with constant feed and drawing speeds. 
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Fig. 3—Block diagram of diameter feedback control loop. 
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where u is the control signal to the motor. The response of the motor- 
drawing mechanism is dominated by the inertia of the pinch wheels, 
while the drawing tension on the fiber has little effect on the response 
of this component. 

The drawing process is a nonlinear, distributed process. Since the 
diameter is to be regulated about a fixed set point, a linear perturbation 
model can be used to represent incremental system dynamics. The 
parameters for this model, as well as the model structure, must be 
determined experimentally. To obtain response characteristics, the 
drawing mechanism is excited with a sinusoidal perturbation and the 
change in draw speed and fiber diameter are measured. Using the 
technique of Fourier filtering,® the relative gain and phase of the 
drawing mechanism and the process diameter response at each exci- 
tation frequency can be determined, and a model can be constructed 
from the data. It is found that, for nominal velocities up to 1 m/s, the 
process can be modeled by 


D;/2V5 


ee eee 
i{s) aps’ + bys +1 


us(s) + n(s), (3) 
where d; and vu; now represent incremental changes in fiber diameter 
and velocity, D; and V; represent the nominal process values, and ap, 
b, are the parameters representing process dynamics. The source of 
diameter variations, n, is a band-limited, Gaussian process with E{n} 
= 0, E{n?} = o2. The parameters a, and b, are sensitive to certain 
process parameters and insensitive to others. They have been found to 
be insensitive to preform diameter, fiber diameter, draw velocity, and 
the temperature of the heat source over the ranges 


1950°C < T; < 2150°C 
7mm <D, <19mm 
Om/s <V; <1m/s 

80 um < D; < 125 pm. 


The response is generally insensitive to changes in preform feed 
velocity over the range of speeds commensurate with the above values 
of D;, V;, Dp, but is known to change for draw velocities higher than 1 
m/s. The dynamic response parameters are also sensitive to the length 
of the heat zone as shown in Fig. 4. The long heat zone was obtained 
in a furnace and the short heat zone with laser heat source.* 

Due to physical limitations, the fiber diameter is measured at some 
point below the heat zone. Using a forward scattering interference 
fringe counting technique,’ the fiber can be measured with an accuracy 
of 0.25 um at a rate of 1000 measurements per second. Due to the high 
measurement rate, the measurement process has no dynamic response 
to contribute to the loop dynamics. As a result of the digital fringe 
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Fig. 4—Variation of process parameters with the length of heat zone. 


counting technique, however, a quantization noise, gm, perturbs the 
control loop. The effect of g,, can be seen from Fig. 3 to be equivalent 
to a loop with perfect measurement and a set point which changes + 
Gm at random times. The effect of the noise on the fiber diameter 
depends upon the rate at which the step changes occur. Experience 
has shown that the quantization noise becomes significant only when 
the standard deviation of other system disturbances have been reduced 
to the level q,, and represents a lower bound of achievable performance 
for the drawing system. 

Since the measurement process must be physically located beneath 
the heat source, it must also be located some distance away from the 
point, or region, where the diameter of the molten zone changes in 
response to variations in the drawing velocity. The distributed nature 
of the process leads to the expectation that the response to a step 
change in V; would result in a distributed change in the boundary of 
the molten material, as well as a change in fiber diameter. Experience 
indicates that small perturbations in draw velocity result in diameter 
variations in the molten zone which are confined to a very small region, 
near the point where the fiber is formed. Consequently, a “point” at 
which the molten material changes diameter in response to changes in 
draw velocity can be defined, and a measurement time delay, T, results 
which relates the delay distance and nominal drawing velocity. 

The resulting model of the measurement process is 


dn(S) 
d;(s) 
where the quantization noise has been reassociated with the diameter 


set point. The delay time is typically 0.04 to 0.10 second for a drawing 
velocity of 1 m/s. 





= 0.040 e~*? volts/um, (4) 
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ll. DESIGN OF THE CONTROL 


The control problem is to reduce the effects of the noise source, n, 
upon the fiber diameter d;. The relationship between the two quanti- 
ties when the feedback loop of Fig. 3 is used is given by 

db dy(s) _ 1 
n(s) 1+ G(s) 


where 6 dy(s) = d;(s) — da, and G(s) is the forward transfer function 


dy(s) uf(s) __uls) dm(s) — da 
u(s)|_, us) dm(s) — da dj(s) 


= H(s), (5) 


G(s) = (6) 





The form of the control circuit is 


u(s) = K. acs + 1 


dn(s) -da 8 b8+1° 2) 





The integral term is required to remove the slowly varying compo- 
nents of the noise source, and the lead-lag term is used to shape the 
response curve. Using this control circuit and the component responses 
(2) to (4), the response curve 


dd;(s) 
n(s) 


2 
S=Jw 











as a function of w is shown in Fig. 5. The response curve shows that 
low frequency disturbances can be effectively suppressed. There is a 
loss of control effectiveness as frequency increases to a point where 
the control loop has no effect upon the disturbances. In the region of 
the corner frequency w,, the noise is amplified. Figures 5 and 6 illustrate 
the role of the control circuit parameters upon the response curve. The 
suppression of low frequency variations is affected only by the loop 
gain. The degree of response peaking around the corner frequency is 
affected by the gain as well as the lead-lag network parameters. Some 
peaking resulting from high loop gain must be allowed, since it can 
only partially be compensated for by adjustment of a, and 6,. It has 
been found that +2 dB is an acceptable peak gain for the response 
curve, from the standpoint of the stochastic disturbances. 

The performance of the control loop can be optimized by choice of 
the parameters K., a-, 6. to minimize the performance index 


J = | |H(Jw) |? dw, (8) 
0 


subject to the constraint 
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Fig. 5—Sensitivity of control loop performance to loop gain. 
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Fig. 6—Sensitivity of control loop performance to lead-lag compensation. 
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max P 
{. = wi 0<o<| | H(jw) |? < 2 dB. (9) 


The performance index (8) is a measure of the relative noise power 
transmitted to the fiber diameter through the control loop. Minimi- 
zation of (8) is equivalent to minimizing 


j= B {ajo}, [n(-)J=1, . (10) 


for n(-) modeled by white noise. This relationship is established by 
the relationship between the power spectral densities 


Tu, = | H(jw) |? Tn, (11) 


resulting in 
J, = | Tg, dw = 202 J. (12) 


Due to the constraint (9), which seeks to reduce the effect of periodic 
disturbances upon the loop design, optimization is most effectively 
performed in the frequency domain. 

The optimum performance for a typical set of drawing system 
parameters is shown in Fig. 7, for the case of two measurement delay 
times. Referring to Figs. 5 to 7, of all the system parameters, the 
effectiveness of the control loop is most sensitive to loop gain and the 
measurement delay time. Examination of the phase characteristics of 
the system components reveals that over the range of controllable 
disturbances the measurement delay contributes the largest phase lag 
to the loop dynamics. Consequently, it is imperative in drawing system 
design to minimize the measurement delay. The loop response is 
sensitive to loop gain because the destablizing tendencies of the mea- 
surement delay pull the root loci toward the jw axis.° 

To determine the effectiveness of the control, account must be taken 
of both the process-related noise, n, and the measurement quantization 
noise qg. The fiber diameter variation is related to these two quantities 
in the frequency domain by the expression 


dd;(s) = H(s)n(s) + G(s) H(s)q(s). (18) 
The power spectral densities are related by 
Paw) = | HG) |? Tr) + | FG) |?T,@), (14) 


where F'(s) = G(s)H(s). The noise generating process associated with 
the fiber drawing process has been experimentally determined and can 
be modeled as 
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Fig. 7—Sensitivity of control loop performance to measurement time delay. System 
parameter values are a, = 0.0032, b, = 0.080. 
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I’, (w) = (15) 


w? + wr’ 

where w, is the corner frequency of the noise spectrum. 
The quantization noise can be modeled as a random telegraph signal’ 

for which the power spectral density function is 


Agi 
NM + 27w”’ 


where A is the mean number of switchings per unit time. The control 
function can be approximated as 


Dy(w) = (16) 


2 
igo) tS 
| (Jo) | aw + we 
2 
: We 
| FGe) "= (17) 


where w- is the corner frequency of the closed loop transfer function. 
The variance of the fiber diameter is then determined by computing 
the auto-covariance function® with zero lag: 
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Equation (18) illustrates an interesting tradeoff. Whereas it is desirable 
to reduce the process-induced fiber diameter variation by increasing 
w- insofar as possible, it is done at the expense of allowing additional 
quantization noise to affect the fiber diameter. The latter source may 
only be reduced by increasing the resolution of the measurement 
system. 

Experimentally, the effectiveness of the control loop is illustrated in 
Fig. 8 by the distribution of mean and standard deviations measured 
on 500-m lengths of controlled and uncontrolled fibers, after the 
process has reached equilibrium. The mean diameter of the uncon- 
trolled fibers varies over a 4-1~m range due to variations in preform 
diameter, nominal drawing velocities, etc. Using the feedback control, 
the mean diameter is held to within 0.1 wm of the set point. This 
_ deviation from set point is due, in part, to the 0.25 4m measurement 
resolution. The standard deviations of uncontrolled fibers are distrib- 
uted over a wide range with the majority of the samples having a 
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Fig. 8—Measured statistical characteristics of controlled and uncontrolled fibers. 
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standard deviation of 0.8 um. Use of the feedback control improves 
fiber quality with the result that standard deviations below 0.30 um 
can be repeatably attained. 

Using experimentally determined system parameters, the controlled 
variance of the fiber diameter given in eq. (18) can be compared to 
experimental values. If w, = 0.02, o, = 0.8 wm, gn = 0.25 um, A = 5, and 
from Fig. 7 w. = 0.2, eq. (18) yields a theoretical standard deviation of 
0.27 um. This value is very close to the median of the experimentally 
measured sample standard deviations. 


Ii. CONCLUSION 


Optimization theory has been applied to the optical fiber drawing 
process, resulting in a diameter feedback control loop which effectively 
reduces fiber diameter variations. In addition to examining the sensi- 
tivities of control loop performance to system component parameters, 
the sensitivities with respect to the noise sources were also examined. 
The expression derived to describe system performance with respect 
to process-induced diameter variation and measurement quantization 
noise showed good agreement with experimental results. 
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On the Structure of Real-Time Source Coders 


By H. S. WITSENHAUSEN 
(Manuscript received January 30, 1979) 


The outputs of a discrete time source with memory are to be encoded 
(“quantized” or “compressed’”’) into a sequence of discrete variables. 
From this latter sequence, a receiver must attempt to approximate 
some features of the source sequence. Operation is in real time, and 
the distortion measure does not tolerate delays. Such a situation has 
been investigated over infinite time spans by B. McMillan. In the 
present work, only finite time spans are considered. The main result 
is the following. If the source is kth-order Markov, one may, without 
loss, assume that the encoder forms each output using only the last k 
source symbols and the present state of the receiver’s memory. An 
example is constructed, which shows that the Markov property is 
essential. The case of delay is also considered. 


1. INTRODUCTION 


The outputs of a discrete time source with memory are to be encoded 
(“quantized” or “compressed’’) into a sequence of discrete variables. 
From this latter sequence, a receiver must attempt to approximate 
some features of the source sequence. Operation is in real time, and 
the distortion measure does not tolerate delays. Such a situation has 
been investigated over infinite time spans in Ref. 1. In the present 
work, only finite time spans are considered. 

The main result is the following. If the source is kth-order Markov, 
one may, without loss, assume that the encoder forms each output 
using only the last k source symbols and the present state of the 
receiver's memory. 

An example is constructed, which shows that the Markov property 
is essential. 


ll. THE MODEL 
2.1 The causal structure 

A source produces a random sequence Xj, X2, --+, Xr where for each 
te {1, ---, T}, X: is a vector in m;-dimensional real space. The source 
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is characterized by the sequence distribution: A given probability 
measure on the Borel sets of the product space of dimension )-17:. 

For each ¢, there is an opportunity for noiseless transmission of a 
signal Y, taking gq; possible values. This signal is produced from the X 
sequence by an encoder. As we consider the problem in real time, 
causality allows the encoder at ¢ to see only the values X, ---, X:. The 
encoders are thus characterized by functions f: R"*°"°*™" — {1, «+>, 
qi}, Borel measurable, ¢ = 1, ---, T. 

At the receiving end, the most that could be accessible at stage t¢ is 
the subsequence Yj, ---, Y: However, we also want to consider the 
case of limited memory, as the receiver might not be able to store this 
whole sequence for large t. The model will be the following. 

At t = 1, only Y, is available, and a discrete variable Z; = ri(Y:) 
taking m, values is stored in memory. For each ¢ > 1, the memory is 
updated by 


Lt = r(Ze-1, Y)), t= 2, sere! Mean 
where Z, takes values in {1, ---, m,} and 
re (1, +s, m1} X (1, ++, Qe} > {1, +++, me} 


is the memory update function. 
The purpose of the receiver is to generate a variable V; in R®* by 


Vi = g:(Y)), 
where gi: {1, «++, qi} > R*, 
and for ¢>1 
Vi = E(Zr-1, Yo), 
where 


8: (1, +++, Mea} X {1, +++ qi} > RB”. 


The interpretation of V, is that it represents an approximation to 
something we wish to know at the receiving end about X,. In particular, 
one may have s; = n; and consider V; as approximating X, itself. 

The functional relationships described above are symbolized in 
Fig. 1. 

The case of full receiver memory is included in this model. One need 
only identify Z, with (Yi, ---, Y, and ™ with the concatenation 
function “append.” 

Furthermore, in this case, 


t 
m= |] Qe. 
k=1 
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2.2 The criteria 


The performance of the system is defined by way of a sequence of 
distortion functions. For each ¢, a Borel measurable function 


we R™ X R*— [0, 0) 
is given. Then 
dt = E(¥AX, V;)} 


measures the distortion at stage ¢. It is possible that J; be infinite, but 
it is always well defined, as the composition of Borel functions is Borel 
and VY, = 0. 


2.3 The optimization problem 


The problem to be considered is the following: 

Given: The integers: T; m1, --+, 7; Qi, -**, GT) mM, °++, M7; Si, °°, Sr. 
The distribution of the X sequence. 

The distortion measures YW, ---, Yr. 

Choose: The functions fi, ---, fr; £1, -++, 27; 11, «++, rr (the latter are 
redundant in the full memory case, i.e., when m, = qige +++ q: for all 
t). A choice of a system of such functions will be called a “design.” 
In order to: 

Minimize (exactly or within «) the sum 


T 
J= VI. 


t=1 





Fig. 1—General system. 
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Remark that nothing would be gained by having J as a nonnegative 
linear combination ¥ c:J, (for instance, with c;=e™, a discount 
factor) because such c; = 0 can simply be absorbed into the definition 
of ¥;. 

It should be said that the freedom of having mn, gr, mi, s:, % depend 
upon ¢ is not a matter of extra generality, but is essential to the proof 
techniques used in the sequel. 

A design producing the values (J, ---, Jr) is at least as good asa 
design producing (Ji, ---, Jr) when / = J/ for all t € {1, ---, T}. 
This, of course, implies the much weaker statement that J= ) is J’ 
=) JS. 

A design may exist which is at least as good as any other; it is called 
a dominant design. In general, however, no dominant design exists 
because the set in R’ of achievable vectors (Ji, ---, Jr) does not have 
a corner (Ji, -+-, J) such that all other points of this set lie in the 
shifted orthant J; = J7, t = 1, ---, T. Instead, the set may have a 
Pareto frontier of “admissible” vectors, i.e., vectors (Ji, ---, Jr) such 
that no vector (J/, ---, J’r) is achievable that has J? < J; for all ¢ with 
strict inequality for some ¢. 


2.4 Special encoder structures 


The encoder f; at a specific stage t > k is said to have memory structure 
of order k, if there is a Borel function 


fi: (1, e+ +, mea} & Brees (1, ee, ge 
such that 
f(Xi, «++, Xi) = fi(Zrr, Xan, +++, Xd) a.s. 


This is equivalent to the assertion that Y; is measurable on the o-field 
generated by Z;-1, X:-x41, » ++, Xz. In other words, the encoder elaborates 
Y; using only the k most recent source outputs X;_4+1, «++, X: and the 
receiver’s current memory Z;-1. 


lil. THE MAIN THEOREM 


The sequence Xi, X2, ---, Xr is said to be kth-order Markov, when, 
given any block of k consecutive X;, the parts of the sequence preceding 
and following this block are conditionally independent. For k = 1, this 
is the ordinary Markov property. Note that the kth-order Markov 
property holds in a vacuous way if T< k + 2. 

Most sequences can be approximated by kth-order Markov se- 
quences for sufficiently large k. If this k is small compared to T, then 
the following main theorem provides a substantial simplification of the 
encoder optimization problem. 


Theorem 1: Suppose the source ts kth-order Markov. Then, given any 
design, there is another design with the following properties: 
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(t) The new design differs from the given one only in the choice of 
encoders. 

(tt) All encoders of the new design have memory structure of order 
k. (The last encoder fr can even be made to have memory structure of 
order 1.) 

(uit) The performance index J of the new design does not exceed 
the index of the old design. 


Postponing the proof of Theorem 1 to Section V, we comment here 
on its significance. It says, in particular, that for a Markov source and 
a receiver with perfect memory, one need only consider encoders which 
generate each code symbol Y, using only the current source symbol X, 
and the past code sequence Yi, Yo, ---, Y:-:. This result is essentially 
dependent on the Markov property of the source as can be seen from 
the following example. 

Take T = 3 and, for ¢ = 1, 2, 3, let n: = s, = 1, qr = 2, (Xs, Vi) = (X; 
— V,)?. Suppose the receiver has perfect memory. Suppose that the 
source sequence (Xj, X2, X3) takes just eight equally probable values, 
namely (13, 1, 3), (12, 1, 2), (11, 1, 1), (10, 1, 0), (—10, —1, 0), (—11, -1, 
1), (—12, —1, 2), (—18, —1, 3). 

At the first stage, if one considers only the minimization of J;, one 
has a classical quantization problem for X;. As X; takes its values in 
two separate equiprobable clusters, the minimum of J; is attained by 
letting Y, signal the sign of X, to identify the cluster. Then V; = +11.5 
and J, = 1.25. Any other choice of the first encoder yields a strictly 
larger value for J;. Furthermore, Y, is already sufficient for the 
attainment of J2 = 0, the second-stage receiver need not even look at 
Y2. However, Y2 can be used to help the third-stage receiver. If one 
lets Y2 signal the parity of Xi, then J; = 0 is attainable by letting Y3 
signal whether | X3| =< 1 or not. 

The design so obtained minimizes -J; for each ¢ (it is “dominant’”); a 
fortiori, it minimizes J = Yj Ji, giving J = 1.25. However, the second- 
stage encoder does not have memory structure of order one. 

Is it possible to achieve J = 1.25 with memory structure of order one 
although the source is not Markov? The answer is no for, if one 
changes the first-stage encoder, this alone will drive -J; and, a fortiori, 
J above 1.25. But if the first encoder signals the sign of X; and the 
second encoder must have first-order structure, then the second en- 
coder is useless. Indeed, X2 contains no information not already con- 
tained in Y,, and the receiver remembers Y,. Now Yj, is useless to the 
third-stage receiver,t and a single binary signal Y3 is insufficient to 
distinguish among the four possible values of X3. The best that can be 


f+ Yi = sgn X; and X3 are independent. 
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done is to form Y3 as in the previous design, giving -J3 = 0.25; hence, 
J = 1.5. 

The optimum design requires encoder f2 to “signal ahead” features 
of X, for the later benefit of receiver g3. This phenomenon is ruled out 
for sources with the Markov property. 


IV. TWO BASIC LEMMATA 


All the results in this paper will be derived from two basic lemmata: 
a “two-stage lemma” and a more complex “three-stage lemma.” Once 
these are obtained, the use of induction and of the technique of 
“repackaging” random variables will suffice. 


4.1 The two-stage lemma 


This lemma uses what is, in fact, the basic line of reasoning in Ref. 
1. Consider a system with T = 2 and any joint distribution of the pair 
of random vectors (X1, X2). Observe that the content Z; of the receiver’s 
memory at the beginning of stage 2 is a certain function of Xj; that is, 


A = $(X)), 


where ¢ is a Borel function (in fact, it is the composition of f; and 7). 
The second (and last) stage is characterized by the functions /2 and ge 
with (Fig. 2) . 


Y2 = R(X, Xa), 
V2 = 82(Zi, Y2). 


Lemma 1: Given a two-stage system with a design in which fz does 
not have memory structure of order 1, one can change f2 (and only fe) 
so that it has this structure and the new design is at least as good as 
the given design. 


Proof: If only f2 is changed, then J, ¢, and gz remain as given. We 
have to show that, for a suitable change in fo, J2 can only decrease. 
Consider the function 


F'((Zi, X2), Yo) = Wo(Xe, g2(Zi, Y2)). 


As Y2 is discrete and F is measurable (by its construction), a measur- 
able function f2 exists (see the appendix) such that 


F((Zi, X2), fo(Zi, X2)) < F((Zi, X2), Yo) 
for all values of Z,, X2, Yo. Hence, by the substitution 
Z = $(X1) 
Yo = fo(M, Xe) 
fo(X2, 82(6(X1), fo((X1), X2))) S Yo(X2, 2((X), fo(X1, X2)) 
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Fig. 2—Two-stage lemma. 


holds for all X,, X2. As the functions ¢,, f/: and f2 are measurable, both 
sides of the inequality are measurable. Since they are also nonnegative, 
the inequality persists when taking the expectation of both sides, 
whether finite or not. This establishes that J2 can only decrease as 
claimed. 





Fig. 3—Three-stage lemma. 
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4.2 The three-stage lemma 


Consider a three-stage system (T= 3) with a Markov source. Assume 
that the last encoder f; already has first-order memory structure, while 
fo does not (Fig. 3). 


Lemma 2: Under the above assumptions, one can replace fz by an 
encoder f, having memory structure of order one, without increasing 
the total cost J = J; + Jo + ds. 


Proof: The first-stage cost J; is unaffected by changes in f2 and the 
effect of the first-stage design is to generate the receiver memory Z; as 
a certain measurable function Z,; = $(Xi), where ¢ is the composition 
rief;. By assumption, f3 can be written in the form 


Y3 = f3(Zo, Xs) 
where Z2 = ¥2(Z;, Yo). 
The cost incurred in the last two stages can thus be written 


Y2(Xo, 82(Zi, Y2)) 
+ W3(Xs, 83(re(Zi, Yo), fa(re(Zi, Yo), Xs))) 
= F(Z, X2, X3, Y2), 


defining the measurable function F. 
Consider now the conditional expectation 


E{F(Z,, X2, Xs, Y2)| Xi, X2}. 


Because X;3 is a finite dimensional random vector, a regular condi- 
tional distribution of X3 exists for any condition. In view of the Markov 
property, conditioning on the pair (X1, X2) is equivalent to conditioning 
on Xz only. Let P(dX3|X2) be a regular version of this conditional 
distribution. 

Then the conditional expectation under consideration can be written 


i P(dX3|X2)F(Z1, X2, X3, Y2), 


where Z; and Y2, which depend only on the conditioning variables X, 
Xo, can be treated as fixed. This integral defines a measurable function 
(nonnegative and possibly extended real-valued) 


G(Z), X2, Yo). 


For any choice of f2, the sum J2 + J3 will be given by the expectation 
of G. Note that X; enters G only by way of Z, and Yo. 

As in Lemma 1, a measurable function f2 exists such that, for all Z,, 
Xo and Yo, 


G(Zi, Xo, f(Z, X2)) = G(Zi, X2, Yo). 


1444 THE BELL SYSTEM TECHNICAL JOURNAL, JULY-AUGUST 1979 


Substituting Z; = $(X1), Ye = fo(%, X2) and taking the expectations of 
both sides of this inequality, implies, by the chain rule, that Jz + ds 
cannot increase when f2 is replaced by fo. 


V. PROOF OF THE MAIN THEOREM 
To begin with, the situation of the last stage is always a special one, 
as the following lemma shows. 


Lemma 3: For any source statistics and any design, one can replace 
the last encoder by one having memory structure of order one, without 
performance loss. 


Proof: The given T-stage system can be considered as a two-stage 
system, by setting 


xX = (Xi, Xo, Seley Xr-1) 


Xo = Xr 
LZ =Zr1= (X1) 
Y.= Yr 


fo(Xi, Xo) = fr(Xi, Xo, +++, Xr-1, Xr) 
&(Zi, Yo) = gr(Zr-1, Y7) 
Vi = (Vi, -++, Vr-1) 


V2 = Vr 

ete i. T-1 
W(X, Vi) = py W(X, V,) 

t=1 


Wo(X2, V2) = Wr(X7, Vz), 


which amounts to a change in notation. Of course, 


a substantial increase in dimension. : 
By Lemma 1, there exists an encoder /f2 which has the structure 


Yo = fe (Zi, Xo) 


and whose use does not increase J2. Reverting to the original notation, 
this corresponds to an encoder fr with the structure 


Yr = fr(Zr-, Xr) 


whose use does not increase Jr. As the other J; are unchanged, the 
lemma is proved. 
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The above fact is the starting point for the proof of the main theorem 
with k = 1. 


Lemma 4: The main theorem holds for k = 1. That is, for a Markov 
source and any design, one can replace the encoders by appropriate 
encoders having first-order memory structure without increase in the 
expected cost -J. 


Proof: Using backward induction, one can first replace fr by fr, as in 
Lemma 3. Now suppose the encoders for stages ¢ + 1, t+ 2, ---, T 
already have memory structure of order one. It must be shown that f; 
can be replaced by f; with such structure, without increase in expected 
total cost. To this effect, the T-stage system can be considered as a 
three-stage system, in which the third stage has first-order memory 
structure and the source is Markov, as follows. Set 


t-1 
XxX = (Xi, +++, Xi-1) (hus n= yi ni) 


=1 


Z, = Z-1 = o(Xi) 

X, = X, 

Y2= Y, 

Zo = Z, = F(L1, Yo) = r(Zia, Y)) 

V2 = Vi = 82(Z1, Yo) = g(Ze-a, Yi) 
bo(X2, Vo) = Y(X,, Vi) 


T 
X3 = (Xi+1, -++, X7), (x = DS n. 


t=t+1 


t=t+1 


T 
Y3 = (Yes, ve ey Y7), (4. = I a) 


T 
V3 >= (Vi+1; arias Vi) = &3(Ze, Y3), (« = > x) 


i=t+1 


The latter relation follows from the fact that each V,, @ > ¢, is given by 
26(Ze-1, Yo) and the variables Z,_1, Ys are known functions of Z,, Yr41, 
Yi+2, +++, Yr using the memory update functions. Then 


T 


W3(Xs, Vs) = YD o(Xe, Vo). 
6=t+1 
As the encoders for stages ¢ + 1, ---, T already have first-order 
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memory structure, their effect is to define an encoder 

Y3 = fs(Z2, Xs) 
because each of the Yo, 0 > ¢, included in Y3 is given by a function 
fo(Zo_1, Xe) and the variables Z_;, X» are known functions of Zs, X3; i.e., 
of Z:, X41, +++, Xr using the memory update functions and recursion. 


The given encoder at stage ¢ has the general form Y; = f:(Xi, Xo, «+>, 
X,-1, Xt) which translates to 


Yo = A(X, X2). 
The new source (X1, Xo, X3) is Markov since X; = (X, +++, Xx) and 
X3 = (Xt41, «++ Xz) are conditionally independent given X2 = X;, by the 
assumed Markov property of the original source. 
Thus, the three-stage system satisfies the assumptions of Lemma 2, 


and f, can be replaced without loss of total expected cost by fz, which 
has the structure 


Y2 = pL, X). 
This translates to an encoder 
Y, = flZr-r, X:) 


for the original problem. Since the notational changes do not influence 
total cost, the inductive step, and therefore the lemma, is proved. 


Note that the above induction is carried out down to ¢t = 2 because 
fi cannot help but have the desired structure, albeit trivially so. 
Turning to the case of general k, observe that the encoders for the 
first k stages have memory structure of order & in a trivial way, 
whatever their design, and for the last stage, Lemma 3 applies. Thus 
the conclusion of the main theorem holds for T < k + 1 trivially, as 
does the assumption on the source. Hence, assume T = k + 2. 
The essence of the proof is a “sliding block” repackaging of the 
source variables. : 
Let Xt = (X,, Xt+1; aah | Xt+k-1) 
for PS 1, 354,cF where T=T-k+123. 
Then the sequence (X, Xo, ---, X#) is Markov. For the variables, let 
Yi = (Yi, ies Yx), 
Y; = Ye+r-1, for t= 2, eee, T 
Zz, = Lt+hk-1; for t = 1, eee, T 
V; = (Vi, +--+, Vz), 


and Vi= Vien, fort=2,--.,T7 
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The functions relating these variables are written as follows: 
Y, = fi(&%) 
summarizes the action of the first Rk encoders, which will remain 


unchanged as they already (trivially) have memory structure of order 
k. For t > 1, 


Y, = fi(Xi, Xo +++, Xi) = frsn-1(Xi, +++, Xen-1) | 


where /; is not uniquely defined by this relation. It can be made unique 
and measurable by requiring (for example) that for 6 = 2, ---, ¢, the 
function f, depends on the argument Xo = (Xo, «++, Xo+z-1) only through 
its last component Xo+n_1. However, any measurable f, satisfying the 
identity is acceptable. 

_ The receivers are characterized by their memory updating functions: 


Z = 7i(V1) 
summarizes the recursive buildup of Z, from (Yi, ---, Ys) using 
r}, +++, rx. For t > 2, Fr is defined by 
Zi = Fe(Zr-1, Yo) = rean—i(Zern—2, Yrrn-1). 
Likewise, 
Vi = &(Y1) 
summarizes the action of the first k decoders (including their memory 
updating). For ¢ > 2, g; is defined by 
Vv. = E(Zr-1, Y,) = Br+nr—1(Zr+n—2, Yerr-1). 
Finally, ¥:(Xi, Vi) = Yea v(X,, V.) and for t > 2 
W(X, Vi) = Vere—1(Xere-1, Vern-1), 


where y; depends on argument X; only through the component X;44-1. 
Now Lemma 4 can be applied to this T' stage system with Markov 
source. Without increase in total cost, the encoders fh, ---, fr can be 
replaced by encoders f; with first-order memory structure, i.e., 


Y; = fZ-1, Xi); for t= 2; ere, 7. 


Expressing this in terms of the original variables, the functions f, for 
t=k+1,---, Tare replaced by functions f; satisfying 


Yisr-1 = fere—1(Zire-2, Xp, X141, e g¢ Xt+k-1) for t= 2, eee, T—k+1 
or equivalently 


Ye= f(Zr, Xi-pt1, °° *, Xe) fort=k+1,.---, T. 
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These /, exhibit memory structure of order k, so that the main theorem 
is proved. 


VI. THE CASE OF DELAYED DISTORTION MEASURES 


The basic model of Section II can be modified as follows for the case 
in which a delay of 6 > 0 steps is included in the definition of distortion. 
The first change is that the variables Vi, ---, Vs are simply not 
generated, the receiver spends its first 6 periods just accumulating 
observations of Yi, ---, Ys and updating its memory accordingly. 

The second change is that, for t > 6, distortion is measured by a 
function y,(X+_3, V:) whose expectation defines J;. The design objective 
is to minimize 

T 
J = Y J, te 
t=6+1 
For this situation, the following structure simplifying result holds. 


Delay Theorem: Suppose that the source is kth order Markov and 
that the distortion is defined with delay 8. Then any given design can 
be replaced, without loss, by one in which the encoders have memory 
structure of order max(k, 6 + 1). 


Proof: In case k = § + 1, one can perform the same transformation of 
the point of view as in the proof of the main theorem. Indeed, this 
transformation gives cost functions of the form 


WAX, Vi), t=1--,T-R+1, 
where 
Xp = (Xy +++, Xena), Vi = (Vi, «++, Va), Ve = Vern. 
This is compatible with the delay criterion, as follows: 


k-5 


W(X, Vi) = 2 W(X:, Vi+s) 


and fort = 2,---,T—k+1 
WAX, V,) = Wer r—-1(Xt4+R-5-1, Vi+n-1), 


where it happens that #, depends upon X; only through the component 
Xt+h—s-1: 

Therefore the argument of the main theorem applies: One can use 
encoders with memory structure of order k. In fact, the above shows 
that this conclusion is valid for any criteria of the form 


by W( Xz, X41, ery Xt+h-1; Visn-1). 
t 
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As for the case k < 6 + 1, observe that the source is then, a fortiori, 
Markov of order 6 + 1. Hence, the first case applies to yield memory 
structure of order 6 + 1, as claimed. 


Vil. CONCLUDING REMARKS 


A few extensions of the results are of interest. 

(i) The proof of the three-stage lemma goes through under the 
weaker assumption that f; depends upon Z2, X2, and X3. 

(it) All the results in this paper remain true for V; restricted to given 
subsets of R*. This would correspond to quantization levels fixed in 
advance, as opposed to their selection as part of the design. 

(iit) Suppose 6 = 0, k = 1, and the encoder is restricted a priori to 
be a finite state machine of the type 


WwW, = hi Wi-1, Xz), 
= fi(Wi-, X:), 


where W, is a discrete variable representing the contents of the 
encoder’s memory. Then the main theorem implies that it is optimal 
to take Z, = W, and h; = 7 since this simulation of the receiver’s 
memory produces the argument required for the generation of Y;. This 
result was obtained independently by N. T. Gaarder. 
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APPENDIX 


Let X be aset and # a o-algebra of subsets of X. Let Y be a finite set 
{1, ---, g}. A function F: X x Y — R is called measurable if for each 
yin Y, the function F(-, y): X — R is Z-measurable. 

Then it follows that a function f: X — Y exists such that 


F(x, f(x)) = F(x, y) 


holds for all x € X and y € Y and the function fis @-measurable (which 
means that {x| f(x) = y} is in # for each y). 

Since y takes only finitely many values, it is evident that, for each x, 
one can select an f(x) to satisfy the inequality. However, there may be 
many x for which the minimizing y is not unique. This creates the need 
for a choice of values in defining f, and if such a choice were made in 
a totally arbitrary manner, it is possible that the resulting f not be Z- 
measurable. What is needed is the (elementary) proof that, for a 
reasonable way to resolve ambiguous choices, the resulting f is auto- 
matically Z-measurable. 
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Given y € Y, consider the set A, of all x for which y is among the 
minimizing values, this set is measurable because it is defined by a 
finite number of inequalities among measurable functions, namely, for 
each y’€ Y, 


F(x, y) = F(x, y’). 


The sets A, cover X but with overlaps. To remove the overlaps, use 
the numerical indexing of Y to define 


B, =A, 


and, for y > 1, 
y-l1 
By, = Ay 7 U Aj. 
i=) 


This construction preserves measurability and removes overlap. Thus, 
if f is defined to take value y on B,, the desired result is attained. This 
amounts to stipulating that, when the minimum is attained for more 
than one element of Y, f(x) is defined as the element with the smallest 
label. 
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Experimental and theoretical results from our work on current- 
access technology show promise for high-density, ~10" bits/cm”, and 
high-frequency, f > 1 MHz, bubble devices. We have operated current- 
access devices where the bubble-driving fields derive from two pat- 
terned conducting sheets instead of orthogonal field coils. Margins 
for generation, propagation, and transfer were studied on 8-u~m 
periods at 1 MHz. These 8-um period structures typically required 1.5 
mA/pm per conducting sheet and dissipated 14 »W/bit. Single 
conducting-sheet, current-access circuitry also propagates bubbles 
but offers less design flexibility. We present design criteria, mag- 
netic field equations, and design curves. Implementation of these de- 
vices required new magnetic materials with quality factor Q com- 
parable to available garnets, yet higher mobility and lower dy- 
namic coercivity. Of the three systems, (YLuSmCa)3(FeGe);O., 
(LaLuSmCa)3(FeGe)5O12, and (LaLuSm)3(FeGa)sO., the last appears 
best suited; some room temperature characteristics of the composition 
Lao.6lLuei;.Smo3Gao9Fes.1012 are 4nM, = 470G, oS 750 cmS”'-Oe™', 
d = 1.6 pm, and AH. = 2 Oe across a bubble for the threshold of 
motion. Necessary improvements in processing were made with a 
radio-frequency, chlorine-containing plasma etch which produced 
metal patterns identical to those of the etch mask. We anticipate that 
current-access devices, when compared to conventional field-access . 
devices, will achieve higher data rates, lower power consumption per 
bit, and greater storage densities with existing processing technolo- 
gies. 


l. INTRODUCTION 


We have been investigating the potential of conductor-access bubble 
circuitry’ to meet the need for a high-density, high-capacity, low-cost, 
nonvolatile memory device. There are two basic ways to move bubbles: 
field-access” and current-access (conductor-access). Field-access de- 
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vices, the only type in use at this time, require a pair of orthogonal 
drive coils to provide an in-plane magnetic field which, coupled to a 
bubble data stream via structured permalloy or ion-implanted fea- 
tures,’ provides the necessary bubble drive forces. 

It is difficult, however, to take advantage of the data rates intrinsic 
in field-access bubble devices as they are scaled down in size. In 
particular, the volt-ampere product increases with increasing fre- 
quency, making economical coil drive circuits difficult to design. Pres- 
ent field-access devices are limited to about a 250-kHz rotating field 
rate and are thus too slow for applications such as television frame 
storage. A bit rate per chip of 1 Mb/s or greater would make bubbles 
attractive to a wider range of potential customers. 

In traditional current-access bubble circuitry, meandering conductor 
strips of relatively high resistance, rather than drive coils, make it 
possible to provide the necessary currents to move bubbles at stepping 
rates of 1 MHz and greater with relative ease but give rise to a concern 
that current-access chips would dissipate excessive power, especially 
at the higher bit storage densities such as 10’ b/cm?”. 

The disadvantage of the current-access circuits disclosed to date has 
been the complexity of the conductor patterns themselves. We have 
developed new configurations, however, that use rather simple aper- 
tured sheet geometries and which should dissipate less power as well. 
Bubble circuits with an 8-um period, a four-times improvement in 
storage density over present circuits, are expected to be achieved while 
maintaining the same design rules of the 16-um period field-access 
circuits now in manufacture at Western Electric, Reading.‘ 


ll. HISTORY OF CURRENT ACCESS DEVELOPMENT 


It is not surprising that the very first experiments to control bubble 
movement used conductor drive. In these first experiments, adjacent 
posts of a “waffle-iron” ferrite base plate were looped by conductors as 
shown in Fig. 1.’ Conductors 1, ¢2, and ¢3 driven by unipolar current 
pulses produced bubble motion with the direction determined by the 
pulse sequence. Bubble domains had to overlap from one post to 
another before proper operation could be obtained. 

A continued evolution in circuit design next led to the single con- 
ductor design of Copeland et al.,” shown in Fig. 2. Note the conductor 
which meanders back and forth across a groove which has been etched 
into the surface of the epitaxial garnet layer. Undulations in the width 
of the groove define preferred bubble rest positions which supplement 
a bipolar current drive to produce bubble motion. It is a basic require- 
ment of this design, however, that a rather precise geometrical fit be 
maintained between the bubble and the groove pattern. The ideal 
bubble diameter is, in fact, one-half of the conductor period making 
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Fig. 1—Schematic of the first bubble shift register circuit. Bubbles in orthoferrites 
were moved three steps with a waffle-iron base plate. 


bubble-to-bubble interactions prohibitive at the maximum storage 
density and hence the circuit could only be operated with data in every 
other position. A variation in which grooved tracks were replaced by 
ion-implanted tracks of similar shape was also operated.* About this 
time, effort in conductor propagation was terminated in favor of “‘field- 
access” propagation. 

Conductor propagation re-emerged in 1974 when Walsh and Charap*® 
proposed a perforated conducting sheet driven by either a rotating or 
oscillating current as a novel bubble drive. Their computer simulation 
indicated that operation comparable to that of field-access devices 
could be accomplished. Their paper went unnoticed. 

In 1975, the bubble lattice storage device was announced by Voegeli 
et al.’ Conductors were used to move bubbles in the main storage area, 
the entrance-exit line, and the detector. In the most recent paper on 
the bubble lattice, Hu et al.® state “Ironically, the “simple” shift 
register propagation of an isolated-bubble has one of the smallest 
margins in the chip.” Hu et al. were somewhat handicapped in their 
design since bias field compatibility had to be maintained with the 
main storage area. Nonetheless, their report was not encouraging. 

Then in 1977, Dekker et al.,° Phillips Research Labs, introduced a 
design which combined conductor and field access on a major-minor 
chip. They replaced the usually field-driven major track by a conduc- 
tor-driven track that operated at a ten-times-higher stepping rate. In 
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this way, they achieved a fivefold reduction in access time. Typical of 
the Dekker designs is that illustrated in Fig. 3. The “conductor” is in 
reality a dual-layer, permalloy-gold sandwich in which the gold layer 
has been included to lower the sheet resistance and, consequently, the 
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Fig. 2—An SEM of a turn of a conductor-groove circuit. The circuit period is 16 pm. 
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Fig. 3—A current-driven major track. An in-plane field generates poles at points of 
the “conductor,” which is in reality a conductor-permalloy sandwich. 
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drive dissipation. A nonrotating in-plane field, supplied by the very 
coils that operate the field-access portion of the chip, magnetizes the 
triangular permalloy points, converting them to low energy resting 
positions for bubbles. The primary drive for bubbles comes from areas 
defined by the conductor loops driven by a bi-polar current to alter- 
nately attract and repel bubbles. As a result, bubbles step from loop to 
loop but always move to the right because of the directionality induced 
by the triangular points described above. Bubbles attracted to these 
points offset to the right from their otherwise neutral position within 
a loop. 

Dekker reported a power dissipation of 0.6 mW/b for this structure. 
It is thus difficult to conceive that their design could be used in an all- 
conductor chip, since the power dissipation for even a 100-kb chip 
would be 60 watts. We see in Section VII that chip partitioning can be 
used to reduce the dissipation considerably, but unlikely, in this case, 
to a useful level. Copeland’s design, on the other hand, did operate at 
very low power per bit, but it was both wasteful of storage area and 
difficult to process. It was difficult to process because its minimum 
feature dimension was a very small portion of the circuit period. 

This, then, was the reported status of current access bubble circuits. 
The expected advantages of higher data rates, ease of driving, and 
simple package design had not been realized. 


lil. APERTURED SHEET FUNDAMENTALS 


It will be helpful to the understanding of the device structures 
covered later if we digress at this point to some design considerations 
peculiar to apertured sheet devices. First, let us consider some analo- 
gies that exist between the more familiar permalloy structures and the 
conductor structures that we introduce in this and the following 
sections. 

An analogy exists between the stray field of an isolated permalloy 
bar immersed in a uniform magnetic field H and the stray field 
generated by the distortion of an otherwise uniform current J in a 
sheet conductor with a slot.° In either case, an isolated magnetic dipole 
is formed. This situation is shown in Fig. 4, where it can be seen that 
a “‘plus”-pole-seeking bubble would be attracted to the upper opening 
of the slot just as it would be attracted to the upper end of the 
permalloy bar. Note also that the directions of the magnetic field H 
and the current J are at right angles while producing identically 
oriented dipoles. We must be careful not to carry this equivalence too 
far, since the field H is applied over a volume, whereas the current J is 
confined to a plane. The practical significance is that a slot can 
completely block a current whereas a permalloy bar cannot completely 
shunt an applied field. 

Next, it is useful to define a sheet current density J(mA/pm) =//W, 
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where J(mA) is the current applied to a sheet of width W(um). We can 
relate current density in conductor devices to inplane field intensity in 
permalloy devices by observing that an infinitely wide conductor 
carrying a current density J = 1 mA/um supports an in-plane field of 
6.3 Oe. This is illustrated in Fig. 5. 

There are, of course, an infinite variety of aperture profiles that can 
replace the slot of Fig. 4. A number that we have considered are shown 
in Fig. 6. By fabricating large-scale models, energizing these in a left- 
to-right direction from a 30-kHz current source and then measuring 
the signals induced in a uniformly wound single-layer coil scaled in 
dimension to that of a hypothetical bubble, we obtained the z-field 
contours also included in the figure. The z-component is that field 
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Fig. 4—Magnetic field H applied to a permalloy bar results in a dipole field. Current I 
applied to a conducting sheet produces a similar field. 





Fig. 5—-A conducting sheet, infinite in extent, develops a 6.3-Oe surface field when 
driven at J = 1 mA/pm. 
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Fig. 6—Contours proportional to the z-component of field measured on scale models are given for apertures (a) through (h). Current 
flow was from left to right. 


component normal to the sheet itself. The results actually apply to a 
square array of identical apertures positioned at center-to-center sep- 
arations of A. Only one quadrant of field is shown, as the others follow 
by symmetry considerations. Especially note how closely the field 
peaks and valleys are confined to the vicinity of the apertures. The 
oversized elements in the lower row give rise to larger fields than the 
shorter elements of the upper row, but at the expense of a higher 
power dissipation. We find the shapes “‘b,” “f,” and “h” optimal when 
items such as the ease of fabrication are also considered. Further 
details on this technique are given in Section X. 

Similar results can also be obtained analytically and we have, in 
fact, devoted most of Section X to this end. One advantage of current- 
access bubble propagation is that they can be given a more precise 
mathematical treatment than is possible with field-access devices. 
Non-linear, permalloy-to-bubble interactions are not involved except, 
perhaps, in the detector itself. For example, we have calculated the 
field contours caused by current flow past a circular hole in an 
otherwise continuous infinite sheet. Two of the resultant design curves 
are given in Fig. 7, where hf is the garnet thickness, a is the hole 
diameter, and Z, is the elevation of the conductor sheet above the 
upper surface of the garnet. These contours give the average field in 
the garnet for an applied current density of 1 mA/um. Conceptually, 
the role of the hole can be viewed as converting an otherwise uniform 
in-plane field (Fig. 5) into a bipolar field at about a 50-percent effi- 
ciency. This compares favorably to the conversion efficiency of a well- 
designed permalloy feature, which is also approximately 50 percent. 

Next we consider the properties of an array of parallel conductor 
strips such as we encounter when detectors are discussed. The analyt- 
ical results shown in Fig. 8 have been derived from design curves 
located in Section X. The parameters apply to a circuit period of 8 um. 
For J = 1 mA/ptm, the peak-to-peak bias field modulation is 9 Oe, 
meaning that a bubble “riding” in the field trough will have its bubble- 
to-strip transition field increased by 4.5 Oe. In an expander-type 
detector, it will be desirable to increase this transition field to at least 
15 Oe so we can conclude that the current density in the detector will 
be increased to ~4 mA/pm. In conductor circuits, there is ample 
opportunity to do this. 

We now present a problem peculiar to the apertured sheet geometry. 
Current in the conductor sheet not only generates the local field 
inhomogenities due to the apertures and an overall inplane field, but 
it also generates a gross z-field. The situation is shown in Fig. 9, where 
the field values pertain to J = 1 mA/pm. As might be expected, this 
field peaks at the edge of the conductor. As a consequence, any bubble 
located near the edge of a driven conductor sheet experiences a wide 
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Fig. 7—Calculated z-field contours in the vicinity of a circular hole for a current density J = 1 mA/pm. 
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Fig. 8—Hz variation generated by an array of parallel conducting strips. The dimen- 
sions are typical of an 8-um period expander detector. 
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Fig. 9—Curve of the z-field due to a uniform current density J = 1 mA/ym flowing 
in a conducting sheet of width 2a. 
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bias change. This problem is substantially alleviated if active bubble 
circuits are confined to the central 80 percent of a sheet. Other 
solutions such as the use of a return conductor can also be employed. 

Finally, let us contrast the performance of apertures and meandering 
conductors with respect to field localization and power efficiency. A 
comparison between the field contour of the undulating conductor 
given in Fig. 10 and those of Fig. 6 reveal that the fields of the latter 
more closely resemble those of a dipole. Furthermore, the fields 
produced by neighboring apertures are seen to be distinct and separate. 
Such is not the case for the former and, in fact, the edges of the 
conductor are biased to nearly one-half the peak field. 

The structure of Fig. 10 requires one-third the current/cell to 
produce the same peak field as that of Fig. 6f; however, its resistance/ 
cell is substantially higher. Consequently, there is little difference from 
a power dissipation standpoint. 


IV. ROTATING CURRENT 


In this conductor access scheme, the propagate elements are perfo- 
rations in a continuous sheet of conductor material deposited directly 
on the epi-garnet. Bubble propagation is accomplished by a pair of 





Fig. 10—Measured 2z-field contours for a meander-line conductor path. Note that the 
peak field extends toward the boundary of the conductor path. 
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bipolar currents applied in orthogonal directions to the conductor 
plane. This structure was first examined by T. J. Walsh and S. H. 
Charap.® Their computer simulations demonstrated the principle of 
operation. The obvious advantage over a field access device is the 
absence of field coils. This permits a simpler package and higher 
operating frequencies, which were previously limited by the field coils. 


4.1 Principle of operation 


A current is applied to a patterned conductor overlay. This current 
produces poles around the apertures. As the current is rotated in the 
conductor plane, the poles rotate about the apertures (see Fig. 11). In 
this figure, a chevron is the aperture (propagate element). As a current 
I passes from left to right, poles are established, positive on the bottom 
and negative on the top of the chevron. The bubble is attracted to the 
positive poles and will rest at the lower portion of the chevrons as in 
Fig. lla. In the following three illustrations, the current is shown 
rotated in 90-degree steps. As the current rotates, the poles it produces 
also shift, giving rise to bubble propagation. 


4.2 Experimental 


Circuits were fabricated with 16-~m and 8-ym periods. Because of 
the processing advantages of single-gap structures, most test elements 
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ROTATING CURRENT ACCESS 
Fig. 11—Principle of operation for a circuit using rotating current for the drive. 


Motion, (a) through (d), proceeds much as for a single-gap permalloy circuit where 
chevrons are driven by a rotating field. 
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were chevrons. Multiple elements, such as the T-bar or X-bar, require 
tighter line-width control and tax processing capabilities. Portions of 
the masks used to make the circuits are shown in Fig. 12. 

Circuits were patterned in a conductor film of 4500 A AlCu. Bubbles 
were viewed in reflection with the conductor sheet serving as the 
reflecting surface. Four drive pulses are applied in quadrature for 
propagation, and a typical pulse train is shown in Fig. 13. Some overlap 
of adjacent current pulses is required. The amount of overlap was 
varied to ascertain the best operating margins with the lowest power 
dissipation. It was determined that 33-percent overlap was optimum. 
Note the slots in the arms of the test structures of Fig. 13. The slots 
function to keep the current flow lines parallel. 

Both chevron and X-bar propagate elements were evaluated. The 
propagate margins for three chevron-like elements at an 8-um period 
are given in Fig. 14. The margins are for various bubble patterns in 
straight-line propagation. Tests with different bubble patterns gave no 
indication of bubble-bubble interaction or reduction in propagate 
margin. Propagation through a 90-degree turn of Fig. 12c was marginal. 
The X-bar structure of Fig. 12d did not propagate domains’ reliability. 


4.3 Status 


The high current densities required for propagation of 4 to 5 mA/ 
pm as well as the complexities of providing multiple current drives into 
a common load curtailed efforts on this single-conductor approach. 
Consequently, problems associated with the design of generators, 
transfers, turns, and detectors have not been addressed. 
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Fig. 12—Test masks used to evaluate rotating current structures. 
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Fig. 13—Details of the overall test circuit. The slots in the lead-in arms prevent the 
current from “pin-cushioning.” 


V. CURRENT-ACCESS PROPAGATION BASED ON A STATIC OFFSET 
FORCE 
In this section, we take an in-depth look at propagation circuits that 
can be driven by a single bipolar current source. The circuits of 
Copeland and Dekker discussed in Section II and those introduced in 
the following section, in which the meandering conductor is replaced 
by an apertured sheet, fall into this category. 


5.1 Description 


An excellent discussion of the static offset force problem has already 
been presented by Copeland et al.’° We expand their results to include 
the effect of the domain wall coercivity Hc. In the usual single con- 
ductor circuit, the current flows in a conductor that crosses back and 
forth over the bubble track as shown in Fig. 15a. When a bipolar 
current is applied to the conductor, it produces an approximate stand- 
ing wave field envelope: 


HA, = Nil sin 27 x/A, (1) 
where the X direction is along the bubble track and H, is a Z-directed 
bubble drive field (Fig. 15b). It is obvious that a bubble can respond to 
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Fig. 14—Operating margins for the three aperture designs indicated. 
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Fig. 15—Schematic layout of a single-conductor circuit. The conductor driven with a 
bipolar current (a) and the idealized z-field standing-wave pattern produced (b) are 
shown, as well as a double-frequency meander-conductor driven by a constant cur- 
rent (c). 


this standing-wave field pattern by either oscillating back and forth or, 
more likely, being ejected from the track altogether. Next add the 
static current Js; (Fig. 15c) which produces yet another field pattern 
but this time with a double spatial frequency. This field is of the form 
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Hs = NsIs cos (42x/)). (2) 


In eqs. (1) and (2), N; and Ns are current-to-field conversion factors. 

It is only illustrative, of course, that we use a static current Is to 
produce Hs. We have already seen that grooves in a garnet or permal- 
loy points can accomplish the same end and without an added power 
dissipation. If the dynamic (Hz) and the static (Hs) drive fields are 
properly proportioned, we achieve propagation as illustrated in 
Fig. 16. The ideal offset is \/8 with an absolute maximum tolerance of 
+ d/8. 


5.2 Conditions for bubble propagation 


Before we discuss the constraints that must be imposed on Hy; and 
Hs to obtain bubble propagation, it is helpful to review some bubble 
fundamentals. A field H applied to a stationary wall causes motion at 
a velocity v(cm/s) given by the equation 


v = w(H — A), (3) 


where p(cm/sec-Oe) is the wall mobility. In the case of a bubble, the 
change in field intensity across the bubble AH is used with the result 


V= . [AH — 8H./7]. (4) 


Since the bubble in the circuit (Fig. 15) experiences fields H; and Hs 
simultaneously, we combine (1) and (2) to obtain 


Hr = Ni sin 27X/X + Nels cos (47X/d). (5) 


There is no general analytical solution for the velocity of a bubble, 
even assuming H; sinusoidal and H. = 0. By graphical analysis we can, 
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| Fig. 16—The unidirectional propagation mechanism for the circuit of Fig. 15. 
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however, determine the minimum fields needed to overcome the 
coercivity H.. From these we can determine the minimum drive 
currents. The mobility can then be introduced as an iteration of the 
coercivity under the assumption that the bubble moves at constant 
velocity. 

The drive field H; and the static field Hs and their sum are plotted 
in Fig. 17. It is assumed that positive fields increase the bias-causing 
bubbles to seek locations at which the applied fields are most negative. 
We treat the case where the bubble diameter is one-fourth the circuit 
period A; however, the analysis can be extended to any diameter. Since 
mobility effects are being neglected, the condition AH 2 8H./z is 
sufficient to move a bubble. Actually, (4) is valid only for a linear 
gradient; however, we assume that any field difference at the extremes 
of the bubble diameter is equivalent to AH. 

We start with a bubble in position 1 of Fig. 17, where 1 defines the 
location of the center of the bubble and, initially, with H. = 0. 
Increasing H; from zero to the amplitude sketched on the figure shifts 
the bubble from 1 to 2 and then, as Hy; returns to zero, to 3. This 
completes a half-step of propagation. The other half-step takes place 
when H7 is reversed in sign. Next, we consider the conditions imposed 
on Hs when the coercivity H. is included. There are “dead spots” at Hr 
(peak), and it is the role of the static field Hs to dislodge bubbles from 
those positions. The minimum values of Hs for any movement what- 
soever is 


2Hs = 8H. 7. (6) 


Equation (6) therefore defines the critical minimum value for Hs. To 
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Fig. 17—Idealized z-fields H; and Hs for a single-conductor propagation circuit with 
a static offset field. Bubble motion is from 1 to 3. 
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deduce the critical amplitude of Hz, we set Hs = 4H./7 and assume 
that there is a bubble at 1. As H7 is increased from zero amplitude, the 
bubble will move to the right, eventually reaching 5. Now 5 is a 
minimum gradient location for the bubble and, to overcome the 
coercivity, the amplitude of H; must reach 


Hy, = 2V2Hs. (7) 


Once beyond 5, the bubble moves quickly ahead since it has excess 
gradient until it stops at 4, where again the field gradient is 8H,/z. 
When #7 returns to zero, the static field Hs completes this half-step by 
nudging the bubble a bit further to the right. 

In summary, our two design equations are 


Hs = 4H. T (8) 


and 


H, = 8V2H./n. (9) 


5.3 Methods to realize a static offset field 


It is beyond the scope of this paper to list all the methods proposed 
to produce a static offset field. Permalloy dots, permalloy points, and 
embossed garnet features, i.e., grooves, are the three most extensively 
reported. In the next section, two more approaches are added to that 
list. They are (i) permalloy features contoured in the z-direction and 
polarized by the bias field and (iz) ion-implanted low-energy bubble 
rest positions produced by selectively implanting zones in the surface 
of the garnet. The remainder of this section, however, is devoted to 
“grooves,” as they are representative of the other approaches. 

In Magnetic Bubbles by O’Dell,'' the equivalence of bubbles and 
current loops is discussed at length. This analogy serves as an excellent 
introduction to the subject of grooves. It can be shown that the stray 
field of a bubble and a loop current are identical if the bubble is 
replaced by a loop current Jz = 2M,h, where the units are mA, gauss 
and pm, respectively. As an example, if h = 1.5 um and M, = 40 gauss, 
parameters typical for an 8-um device, then Jz = 120 mA. Shown in 
Fig. 18a, the “loop current” representation aids in the comprehension 
of bubble behavior. 

For a cylindrical hole in a magnetic layer, a similar equivalent 
current is I, = M,h, which, for the parameters above, is 60 mA. For 
partially etched holes, i.e., craters, the expression is I, = M,Ah (Figs. 
18b and 18c). Since an undulating groove is nothing more than a 
continuous chain of craters, the analogy can readily be extended to 
cover that configuration. 

We have also included the design curves of Fig. 19, which give the 
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Fig. 18—The loop current magnetostatically equivalent to a typical bubble found in 
8-j4m period devices is given in (a). Similar relationships hold for a hole (b) and a partial 
hole (c). 


field of a current loop located on the surface of a plate of thickness h. 
These curves will be useful if a more detailed analysis is attempted. 
They give the z-field averaged through the plate thickness h and 
normalized to the field at the center of the loop. 


VI. KEYHOLE AND KNOTHOLE CIRCUITS 


A class of bubble devices is now introduced, i.e., those driven by 
currents applied to apertured sheets rather than to arrays of conduc- 
tors. In this section, only those configurations with a single conductor 
sheet are described. In Section VII, those circuits with two conductor 
sheets are covered. 


6.1 Keyhole circuits 


The idea for keyhole circuits originated from considerations involv- 
ing the well-known “conductor crossing problem” in permalloy circuits. 
In particular, we were concerned with the role of the bias field. The 
bias field necessary to stabilize 6-uwm and 1-um diameter bubbles is 
about 180 Oe and 450 Oe, respectively. Consequently, the direct 
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Fig. 19—Z-field contours normalized to the field Ho at the center of a current loop. 
These curves are useful to characterize bubble-bubble as well as bubble-etched feature 
(embossed) interactions. 


influence of the bias field on the magnetic state of the permalloy 
features cannot be neglected in circuits with small bubbles, i.e., high 
density circuits. 

We use the bias field to produce a static offset field by the approach 
seen in Fig. 20. The field Hyias has a component H directed along the 
body of any permalloy feature inclined to the plane of the garnet. Such 
is the case, then, for a permalloy feature patterned at the edge of a 
tapered conductor. Magnetic poles that repel bubbles develop at the 
end of the permalloy feature nearer the garnet, while those that attract 
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Fig. 20—A permalloy element, located on the edge of a tapered conductor, polarized 
by a component of the bias field. 


bubbles develop at the end farther from the garnet. A reversal of the 
bias field direction will reverse the sign of the permalloy poles. How- 
ever, the poles of the bubble also reverse, so the interaction remains 
unchanged. It is relatively easy to approximate the magnitude of the 
local field perturbations presented to the bubble. These calculations 
show that peak fields of a few oersteds to tens of oersteds can be 
expected. 

Bubble propagation circuits capitalizing on this effect were designed; 
Fig. 21 is typical of those in which the apertures, squares fitted with a 
tab, are keyhole-like in design; hence, the name. Position and shape of 
the permalloy bar is seen in more detail in Fig. 22. For the reasons 
described above, bubbles are attracted to the ends of permalloy bars 
and repelled from their centers. Bubbles travel from left to right, 
moving from bar to bar in response to a bipolar current applied 
vertically to the sheet. They must “tunnel” through barriers as they 
travel from end-to-end on a given bar, but not as they jump from one 
bar to another. This asymmetry was apparent in the minimum drive 
currents of circuits that were tested. 

For an 8-ym period circuit operated at 500-kHz, pulse amplitudes 
were required of 11.2 mA/um to move along the permalloy and 7.2 
mA/pm to bridge successive elements. Construction details were: 0.10- 
pum SiO» prespacer, 0.28-4m AlCu, 0.23-um SiOz spacer, and 0.62-~m 
permalloy (ploy). With a garnet material that supported 1-u~m bubbles, 
the operating bias range was 495 + 17 Oe. A feature of this class of 
circuits is that they can accommodate a very wide range of bubble 
diameters. One foremost disadvantage is that they are difficult to 
process because of the intricacies of the aperture design. 
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Fig. 21—Bubbles in this keyhole circuit move from left to right in response to a 
bidirectional current drive. The tab within the hole elevates the end of the permalloy 
bar but does not interfere with current flow around the aperture. 
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Fig. 22—Perspective view of a keyhole aperture shows the contoured permalloy. 


6.2 Knothole circuits 


In a knothole circuit, ion-implanted islands are combined with a 
single-apertured conductor sheet to form a bubble shift register. Uni- 
form ion implantation of a garnet surface is the traditional method of 
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suppressing hard bubbles. Field-access bubble propagate circuits that 
use implanted patterns rather than permalloy elements are being 
considered for next-generation high-density applications. The uniaxial 
anisotropy of most garnet materials, and especially those treated in 
this paper, is lowered by any implantation that stresses the lattice. 
One can consider that bubbles couple to implanted areas because these 
areas tend to reduce the stray field energy by providing flux closure. 
The interaction, however, is more complex, as it can involve “charge 
walls” in the implant layer. Also, bubbles are especially attracted to 
the implant side of boundaries that partition implanted and nonim- 
planted areas. Phenomologically, an implanted island behaves like that 
of an area that has been thinned and as such can be replaced by a loop 
current. It follows then that the knothole circuit is a physical repre- 
sentation of the hypothetical structure analyzed in Section V. 

A straight-line knothole shift register requires two implant islands 
and a single, simple aperture for each circuit period. Note that a tab in 
the aperture is no longer necessary and has been removed. Hence, this 
circuit has been given the descriptive name knothole. The general 
layout and operation of three parallel shift registers is illustrated in 
Fig. 23. Bubbles in the center row propagate to the right, those in the 











Fig. 23—Bubbles move from rest positions at the ion-implanted islands in response 
to current flow downward. Each circuit period requires two implanted islands. 
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outside rows propagate to the left. A bipolar current applied to the 
sheet drives the bubbles. 

The performance of knothole circuits is sensitive to bubble size. To 
illustrate this point, refer to Fig. 24 where the data are presented for 
a 4-um period test circuit using a nominal 1.7-um bubble material. 
Islands were given an implant of 2.5 x 10’* Ne at 100 Kev, and no 
overall implant was included. A static in-plane field of 210 Oe was 
applied parallel to the current. Bias margin, frequency range, and drive 
power/bit (based on 0.1 ohms/cell) are outstanding. The direction of 
propagation is opposite, however, to the design direction. The oversize 
bubbles straddle pairs of implanted areas during operation. 

Design of an 8-um period closed-loop shift register is given in Fig. 25 
and data in Fig. 26. Here the nominal bubble diameter is 1.6 zm, which 
is a good match to the nominal 2 um by 4 um implant islands; hence, 
the direction of propagation is correct in the classical sense, i.e., 
clockwise around the loop. The islands were implanted at 3x10" with 
Ne at 50 Kev. Driven at J = 3 mA/pm, the continuous power dissi- 
pation is 58 pw/b. 

A variety of z-turns were tested during the course of this develop- 
ment, and the most successful of these is incorporated in the circuit of 
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Fig. 24—Operating margins vs frequency for a 4-~m period knothole circuit. 
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Fig. 25. The shape of the aperture in the turn was optimized from field 
plots on scale models and iterated until the desired drive gradients 
were seen. Operating margins of this turn improve when an in-plane 
field is applied parallel to the direction of the drive current. For the 
data of Fig. 26, the in-plane field was 100 Oe. 

Straight-line propagation parallel to the drive current was achieved 
with the circuit of Fig. 27. Operating margins are similar to those 
obtained for propagation normal to the current, and much of the same 
design criteria apply here. Although the drive currents per cell are 
lower, the resistance/cell is substantially higher; thus, the power/cell 
is essentially unchanged. 

A detailed description of support functions such as generation and 
detection is deferred until Section VII, which treats dual conductor 
circuits. 


Vil. DUAL CONDUCTOR OPERATION 


The knothole configuration with its single perforated sheet and 
“offset-force” zones does not have the design flexibility that can be 
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Fig. 25—Details of z-turns used in 7-step shift register loops. Clockwise os can be 
mirrored to make counterclockwise turns. 
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Fig. 26—Data for a 15-step knothole loop designed as in Fig. 25. Bubbles were 
propagated in 14-step bursts. 
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Fig. 27—Propagation parallel to the direction of general current flow. 


achieved with a pair of conductor sheets. For example, with the pair of 
conductor sheets it is possible to transfer a bubble, i.e., switch a bubble 
from one path to another, or reverse the direction of propagation, 
simply by altering the propagate pulse sequence. This section deals 
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with straight-line propagation parallel and perpendicular to the direc- 
tion of current flow, closed loop design, nucleate generation, detection, 
and transfer. Also included is a discussion of factors that influence chip 
design. The discussion is limited to the case where the currents in the 
two levels flow either parallel or anti-parallel to one another. It will be 
apparent to the reader, however, that, if the structures are reconfigured 
so that the currents flow perpendicular to one another, a new family 
of device structures somewhere between the rotating current structures 
of Section IV and those of this section could be realized. 

Dual conductor circuits are fabricated with a first patterned con- 
ductor level either directly on the epitaxial garnet or onto a pre-spacer 
layer, an insulating layer, and finally a second patterned conductor 
level. Details of the actual fabrication are covered in Section IX. It 
will be useful at this time to describe the general layout of the test 
structure depicted in Fig. 28. All the test data described in the following 
sections were taken on circuits located within the cross-hatched region 
which is 320 um wide and 600 pm long. The paths that lead into this 
region are shaped to make the current density within the region as 
uniform as possible. 

In line drawings equivalent to processed circuits, for a view looking 
down onto the second conductor layer, the following conventions will 
be established and used throughout (see Fig. 29). All apertures in the 
first conductor level (that nearest the garnet) are outlined by a solid 
line, whereas apertures in the second level are indicated by a dot 
pattern. Sheet currents 4,3 and Jo, flow in the first and second 
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Fig. 28—-Layout of the circuit used to test dual-conductor designs. Two or more probe 
fingers contact each pad area located along the bottom edge. Uniform current density 
is maintained at the test area. 
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conductors, respectively, where they generate local bias-field pertur- 
bations at positions 1, 3 and 2, 4. For a bubble-stabilizing field Ahias 
directed downward as shown, bubbles are attracted to location 1 for 
current J, location 2 for Jo, etc. 


7.1 Propagation normal to the current flow 


In Fig. 29, it can be seen that a bubble initially at position 1 
propagates upwards (normal to the direction of current flow) when the 
current pulse sequence [234] is applied. Sequence [321] returns the 
bubble to its starting position. Straight-line propagation over a further 
distance is achieved by simply linking like pairs of apertures as in Fig. 
30. The length of each aperture is 0.5, and the distance between 
successive like positions is the circuit period, A. It should also be 
apparent that the maximum tolerable misalignment of apertures in the 
vertical direction is 0.25A. 

Typical current-pulse waveshapes to produce the sequence 


—- APERTURE IN 
LEVEL 2 






4—_— —— —> 1, 3 
-- APERTURE IN 


LEVEL 1 
;,-——— + |, 2 


Fig. 29—A four-position, dual-conductor, bubble-stepping circuit. Bubbles are at- 
tracted to positions 1 through 4 when currents J, through J, are applied. 





(a) (b) (c) (d) (e) 


Fig. 30—A time sequence of the operation of a dual-conductor shift register. The 
direction of propagation will be reversed if the currents are applied in the sequence (e) 
through (a). 
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[123412341] for stepping a bubble through two periods are sketched in 
Fig. 31. Some overlap between successive pulses is helpful to further 
quantize the traveling-wave bias field minimum that drives the bubble. 
Also, to get a bubble fully advanced into position for, say, another two- 
step burst, then a final [41] overlap should be provided. 

Success in aperture design is measured in terms of generating the 
maximum possible field gradient for the lowest power dissipation. 
Almost any aperture will perturb an otherwise uniform current density 
and give rise to bubble motion. Some of those tried were shown in Fig. 
6, along with field plots obtained on scaled-up models (details in 
Section X). This field data information coupled with both processing 
and operational experience points to the rectangular shape 0.3A by 0.5 
as the most practical choice. 

The operating margins for a series of 8-wm period dual-conductor 
test circuits are presented in Figs. 32 to 38. Circuit data cover both 
unimplanted and implanted garnets as well as the influence of in-plane 
fields parallel to the drive currents. A frequency and temperature run 
are also included. Individual data points are shown wherever they aid 
in understanding the experiment. Bubbles used in these tests were 
obtained by subjecting a chip to a large in-plane field. Testing contin- 
ued until the bubble supply was depleted, and a new supply was 
initialized. 

We report, in Fig. 32, 1-mHz data on an implanted garnet test circuit 
driven with a two-step to and from sequence [1234123432143214]. Of 
special interest is the spread in data points, especially at low drive. We 
suspect that this is a result of the bubble states found in implanted 
gamets’” “‘ and to a greater extent in unimplanted garnets. The domain 
structure of the implanted layer is also suspected. 

The low bias margin is limited by stripout, as is usual for most of 
the dual-conductor circuits tested. This is especially significant when 
detection is considered. Free-bubble collapse limits the absolute upper 
bias, since “hold currents” are not maintained between pulse bursts. If 
an in-plane field is applied,’ much of the uncertainty in the data points 
is eliminated, as is evident in Fig. 33. 
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CURRENT PULSE SEQUENCE 


Fig. 31—The current pulse sequence used to step bubbles in a dual-conductor circuit. 
The overlap of pulses is used to further quantize the propagation field. 
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Fig. 32—Bias margin vs drive for an implanted garnet tested at a 1-MHz stepping 
rate in an 8-ym period dual-conductor circuit. Each data point represents an individual 
check of the upper bias margin. The spread in the test data indicates that the bubble 
state and/or the implant layer magnetization does not reproduce from test to test. 


When an unimplanted garnet is tested, the upper bias margin often 
shows two distinct distributions (Fig. 34). Failure modes associated 
with these curves indicate that two distinct bubble states, probably 
the S = 0 and S = 1, are involved. With a sufficiently intense in-plane 
field, the double distribution disappears. Refer to Fig. 35, where 
margins for both a 200-Oe and 300-Oe in-plane field are given. 

These experiments suggest that an in-plane field is helpful to stabi- 
lize bubble propagation in dual-conductor circuits. We do expect the 
shape of the field gradients to continually redirect bubbles toward the 
desired path. However, it is known that S # 0 bubbles do not move 
along the maximum field gradient, but rather at angles (often near 
‘ right angles) to the gradient. In Table I, the skew angle calculated for 
S = 1 bubbles are tabulated for the garnet compositions reported in 
this paper. Further insight into the problem comes from an analysis of 
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Fig. 33—-Margin data for the conditions of Fig. 32, except that an in-plane field from 


65 to 306 Oe was applied parallel to the current flow. The spread in data points was 
virtually eliminated. 


Table |—Comparison of material parameters 


Material q 4nM, AH: Bb y y* forrt 
Type (ratio) (gauss) (Oe) (cmS/s-Oe) (1/Oe-s) (degrees) (MHz) 
YSmLuCaGe 3.8 524 2.0 300 1.7 x 10’ 22.5 0.4 
garnet 
LaLuSmGa 4.0 478 1.5 500-1000 1.7 x 10’ 35-45 0.5-1.0 
garnet 





* Tan fy = 4uS/yd (S = 1). 
+ forr = 4H. /aA (A = 8 pm). 


the trajectory of an S #0 bubble moving in the dipole field generated 
at a circular hole. Details are presented in Section X. The choice of 
whether to cope with bubble states by one of the conventional hard 
bubble suppression methods or by an in-plane field will be dictated by 
the method chosen to detect bubbles. It may prove advantageous to 
shape the apertures, particularly in turns, to take advantage of pre- 
ferred propagation angles. 

In Fig. 36, the results are presented of a frequency run on an 
unimplanted garnet in a 200-Oe in-plane field. These results are 
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Fig. 34—Bias margins vs drive for an unimplanted garnet tested at 1 MHz in an 8-~m 
period dual-conductor circuit. Two distinct distributions in the upper bias margin were 
observed. 
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Fig. 35—The data of Fig. 34 repeated except with an in-plane field applied parallel to 
the current. With unimplanted garnets, substantially higher in-plane fields are needed 
to tighten the data points. 
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Fig. 36—Operating bias margin vs frequency for an unimplanted garnet film with a 
200-Oe in-plane field. 
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Fig. 37—It is wasteful of power to run current-access circuits except at a frequency 
forr at which the energy dissipated per step is a minimum. Calculated and experimental 
values of fopr are compared. 
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Fig. 38—Operating margins vs temperature taken at 30 kHz with a four-step propa- 
gation sequence. The garnet was unimplanted. 


consistent with predictions based on calculated drive gradients and 
measured garnet properties. We can use these results to introduce fopr, 
the optimum frequency at which to step a bubble from one point to 
another to minimize energy dissipation. It can be shown that fopr = 4 
pH./n\. The relative energy per step (proportional to J*/f) vs fre- 
quency calculated from the data of Fig. 36 is plotted in Fig. 37. The 
calculated fopr for wafer B-3869 (H,. = 0.5 Oe, . = 1,000 cm/sec-Oe) 
compares favorably to the frequency corresponding to the energy 
minimum of Fig. 37. 

Operating margins at 30 kHz for an unimplanted garnet over the 
temperature range —30°C to +80°C are given in Fig. 38. These data 
were furnished by J. L. Smith. The minimum drive increases at low 
temperature much as expected due to the increase in coercivity. Note 
especially the very wide margins at 80°C. 


7.2 Propagation around a closed loop 


Details of a nine-step, bi-directional, closed loop are illustrated in 
Fig. 39. Note that the slots in the turn are 0.75, in length and that 
elements that lead into the turn are repositioned somewhat. Bubbles 
propagate clockwise for the drive sequence [---12341234---] and 
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counterclockwise for [- - -43214321-.--]. Bias field-drive characteristics 
for a 6-um period loop operated at 1 mHz are given in Fig. 40. The 
margins are quite adequate, as is generally the case for loops at the 
center of the test area. If z-field cancellation is not used, bubbles in 
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Fig. 39—-A 9-step dual-conductor closed loop. The elliptical apertures in the turns 
have been lengthened for the reasons described in the text. 
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Fig. 40—Bias margin vs drive for a 6-um period, 7-step loop operated at 1 MHz. The 
garnet was unimplanted. 
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loops located near the outer edges of the test area are biased by both 
an alternating and a constant field. A typical failure mode (Fig. 39) 
finds bubbles in clockwise motion, jumping to 3d from 2b rather than 
stepping to 3b. Some modifications were necessary to cope with this 
problem. 

First, the two slots in the turn were lengthened further to 1.25, and, 
second, additional apertures were placed beyond the turn itself to help 
equalize the current distribution. The latter is readily accomplished 
by interconnecting loops end to end (Fig. 41). Note that the bubbles in 
adjacent loops move in paths counter to each other just as the gears 
of a gear train. 

Impetus for making the modifications given above came from test 
circuits in which interconnecting loops spanned the width of the test 
area. Loops with lengthened turn-elements outperformed all the others 
in positions 1 and 5, whereas all loops were satisfactory in positions 2, 
3, and 4. 

These experiments also gave additional evidence of the “gyro” 
nature of the bubbles in our materials. Consistently, the even-num- 
bered loops outperformed the odd-numbered loops for one direction of 
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Fig. 41—Loops can be interconnected end-to-end by the method shown. The propa- 
gation direction of adjacent loops alternates, i.e., cw, ccw. Performance is related to the 
propagation sense and bias field direction. 
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bias and vice versa. One additional observation not as yet understood 
was made: Operation was generally better if the bias field was directed 
toward rather than away from the garnet surface. 


7.3 Propagation parallel to current flow 


In this section, propagation parallel to the direction of general 
current flow is considered. This is not an absolutely essential operation 
for a chip layout, since such motion can also be made by combinations 
of motions normal to current flow and the z-turns discussed in the 
preceding section. In fact, our earliest shift registers were designed 
that way. The principle of operation, from Fig. 42a, is straightforward. 
A bubble starting at the leftmost position la proceeds through 
1a2a3a4a1234 in response to pulse sequence [123412341]. 

The actual operation is far from ideal, since propagation down the 
center track is stable only for a restricted (and oftentimes non-existent) 
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Fig. 42—Circuit for propagation parallel to the current. The original bidirectional 
design (a) was modified (b) to be unidirectional to improve performance. 
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set of drive and bias conditions. It is observed that a bubble in transit 
through 1a2a jumps to 3b rather than proceeding to 3a as desired. This 
occurs since the current density at 3b, and thus its pole strength, is 
substantially greater than at 3a, thereby counteracting 3b’s increased 
distance from 2a. 

Operation improves substantially if the apertures are increased in 
length and staggered as shown in Fig. 42b. This modification further 
removes the offending poles from the intended bubble track and also 
forces increased current along the bubble track. As a result, the circuit 
of Fig. 42b gives adequate margins for motion [- - -12341234. --], how- 
ever, motion along [- - -43214321---] is unreliable because of encoun- 
ters such as ld to 4d rather than 4a. 

Operating margins for an 8-um period shift register stepped at a 1- 
mHz rate are given in Fig. 43. The garnet was not implanted nor was 
an in-plane field applied. The performance is seen to be quite good. 


7.4 Bubble generation 


A nucleate bubble generator can either be patterned in one of the 
propagation conductor levels or it can be provided in a separate 
conductor level. Only the former design is considered here, since the 
design of the latter is obvious. Now it is possible to introduce slots in 
a sheet conductor if the slots are narrow (1/4A or less) and run 
essentially parallel to the current flow. Such slots can be used to 
concentrate current and thus nucleate bubbles, yet permit current flow 
during normal propagation. Just how this can be done is seen in 
Fig. 44. 


BIAS(Oe) 





1 2 3 
J(mA/pum) 


Fig. 43—Bias range vs drive at 8-um period and 1 MHz for circuit (b) of Fig. 42. The 
garnet was not implanted. 
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NUCLEATE | PROPAGATION 
GENERATOR DIRECTION 





\ SLOT #1 


Fig. 44—A nucleate generator can be patterned as part of a conducting level. Slots 
guide the generator current but do not interfere with normal propagation. Bubbles are 
nucleated at the tip of slot #2. 


The generator is incorporated into level 1 since that ensures the 
lowest possible generate current. Slots #2 and #4 funnel the generator 
current J, from the input pad into the generator proper. A continuation 
of slot #4 acting in conjunction with slots #1 and #3 forms relatively 
high resistance paths so that most of the incident current J, concen- 
trates at the tip of slot #2, nucleating a bubble, before diffusing as it 
enters the chip area below. 

Generator operation has been characterized in a nominally 8-um 
period shift register processed on wafer B3954. The drive conditions 
were: amplitude 1.5 mA/ym, width 1.5 us with 33 percent overlap. The 
generate pulse was applied in conjunction with propagate pulse 4. 
Subsequent propagate pulses move the newly generated bubble along 
the shift register track. Both adjacent and isolated domains were 
generated. The 200-ns wide generate pulse had an amplitude range of 
230 to 360 mA. 

The composite bias margins for generation at and propagation away 
from the generator were 19 Oe, with the lower end of the margin at 
stripout. For propagation through the generator, the margins reduce 
to 15 Oe with most of the margin loss at the low end. To ensure proper 
motion of bubbles as they pass upwards through the generator struc- 
ture, a current density J;,3 must be maintained between slots #3 and 
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#4, In our experiments, a single resistor diverted a fraction of the drive 
current J,,3 into the generator pad. 


7.5 Detection 


Field-access permalloy devices use a chevron expander detector in 
which a bubble domain is stretched laterally before detection takes 
place in a magnetoresistance sensor. This approach has the advantage 
that it permits unlimited expansion without any effect on the data rate 
of the chip. The exit portion of the detector can be designed either to 
shrink the strip domain back to a bubble or to discharge the strip 
domain into a guard rail. The detector we report is similar to that just 
described and to a detector used with the bubble lattice.® 

Bubble expansion and contraction is accomplished in the structure 
of Fig. 45. The analogy to a chevron expander structure is apparent. 
Factors relating J and H, for an array of parallel conductor strips were 
introduced in Section III and are detailed further in Section X. In 
typical geometries, a current density J = 4 mA/ym generates a bias 
field decrement of nearly 20 Oe. In other words, stripout in the detector 
will be sustained over a 20-Oe range of the bias field. Since all the 
dual-conductor functions operate down to the strip-to-bubble transi- 
tion field, we can expect to realize most of that bias field range. 

Design curves to estimate the in-plane field H, due to (i) the strip 
domain and (ii) the drive conductor strips are given in Figs. 46 and 47, 
respectively. At 8-~m period, assuming 47M, = 500G and J = 4mA/ 
pum, we expect H, = 100 Oe for a typical strip domain and a maximum 
field of 20 Oe (depends on detector-to-conductor strip positioning) 
from the drive conductors. The magnitude of these fields suggests that 
a thin-permalloy magnetoresistive sensor can be used. A photograph 
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Fig. 45—Details of a dual-level expander detector. Bubbles, entering from the top, 
are expanded for either magnetoresistive or inductive sensing at A- -A. 
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Fig. 46—Calculated in-plane field from a strip domain effective on a magnetoresistive 
sensor. 


of the conductor-expander section complete with a “Chinese charac- 
ter’ sensor is seen in Fig. 48. The sensor is processed on top of the pair 
of slotted conductor sheets. 

In current-access devices, there is the opportunity to design a chip 
without permalloy. For example, if a strip domain is driven by an RF 
component of the bias field, the variation in the strip’s flux can be 
sensed by a conductor pickup loop. This scheme is being pursued by 
J. M. Geary, and he has allowed us to present some of his preliminary 
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Fig. 47—Calculated in-plane field from the parallel drive strips of an expander 
detector. 


results. A single-turn pickup loop 20 pm in length was patterned in a 
conducting sublayer at “A.--A” of Fig. 45. An output signal of 5 nV 
was obtained when the width of a strip domain was modulated at 8 
MHz (Fig. 49). RF detection in magnetic devices is not new and, in 
particular, we refer to an excellent article by Benrud et al., which deals 
with RF detection in thin permalloy-film memories.’® 


7.6 Transfer 


Transfer is a particularly useful function since, combined with 
propagation, generation, and detection, it completes one set of func- 
tions that can be used to design a major-minor chip. This is significant, 
as it is well known that the major-minor organizations give improved 
performance over single loop shift registers. More important is that a 
dual-conductor major-minor chip can be partitioned into individually 
accessed regions with a resultant reduction in chip power dissipation. 

One obvious way to implement a transfer is to use a “transfer” 
conductor patterned as part of a third conductor level to gate bubbles 
from minor storage loops to a major track and back again in much the 
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Fig. 48—An sEM photograph of an expander detector with a thin permalloy magne- 
toresistive sensor. The circuit period is 8 um. 





Fig. 49—Strip domains, excited by an 8-MHz RF signal, are detected in a single-loop. 
The raw signal, approximately 5 pv in amplitude, amplified and demodulated is shown 
in the upper trace. The bias field was taken above bubble collapse for the middle trace. 
A composite of the drive current is shown in the lower trace. Horizontal scale is 
20 ps/div. 
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spirit of “field access” permalloy structures. However, perhaps one less 
processing level will be needed if the transfer can be accomplished 
within the framework of the propagate conductor levels themselves. 

Such a compatible transfer gate can be realized by nesting apertures 
within one another as per the upper turn of the closed loop in Fig. 50. 
When this arrangement is used, it is only necessary to modify the 
propagation pulse sequence to cause bubbles in the transfer position 
to move into exit paths, whereas bubbles not in the transfer position 
are only idled backward and forward and always in a manner consistent 
with the retention of data when normal propagation resumes. Return- 
ing to Fig. 50, bubbles in the nine-step loop propagate clockwise with 
the standard pulse sequence [---12341234---]. However, a bubble 
located at the transfer position “la” is transferred to position “4b” by 
the sequence [123412141234] in which the steps associated with the 






NORMAL 
CLOCKWISE [-°°12341234-°] 
PROPAGATION 


TRANSFER 
OUT [°°°123412141234>-::] 
SEQUENCE = a= 


BIAS(Oe)} 





J(mA/um) 


Fig. 50—Design and performance of a transfer port. In the transfer itself, the first 
level slot is nested within the slot of the second level. The propagation sequence is 
reprogrammed to transfer. 
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actual transfer have been underlined. It should be remembered that if 
data are to be preserved elsewhere, it is important to use only next 
number transitions in the transfer sequence. 

Margins of propagation and transfer taken at 400 kHz are also 
included in Fig. 50. It can be seen that this transfer has very good 
margins indeed. More recent data have shown that, when the pulse 
which moves a bubble from 1b to 4a (4 in the underlined sequence) is 
lengthened by 0.25 us, then an otherwise 1-MHz operation can be 
maintained. That is, only even-odd number transitions are permitted. 
ted. 


7.7 Dual-conductor chip design considerations 


There are two distinct aspects to the minimization of dissipation in 
dual-conductor devices. We discuss the minimization of power con- 
sumption on a local basis in Section 7.7.1 and on a chip basis in Section 
Voie 


7.7.1 Optimization of conductor thickness 


Power dissipation will always be a concern in current-access devices. 
For example, for J = 1.5 mA/pm, A = 8 pm and R = 0.1 Q/U, the 
continuous power dissipation is 14.4 ».W/b. Even at J = 1 mA/um and 
A = 4 pm, the power dissipation is 1.6 uW/b. In this section, we turn to 
the problem of finding the correct thickness for conductors 1 and 2 
under the constraint of an intervening insulator of thicknesses. From 
a practical standpoint, the insulator must withstand any voltage that 
exists between the two levels. The drive conductors are usually com- 
moned at one end so that the maximum voltage encountered is the 
sum of the voltage drops of the layers. For the chip design discussed 
in the next section, the maximum inter-conductor voltage is 7.5 V. 

For the purpose of calculating relative power consumption in a 2- 
conductor device, regard the fields produced by each level as those 
from an infinitesimally thin conducting sheet at the median plane of 
the actual conductor. For numerical work, the fields computed from 
eq. (25) will be used. When the two-conductor levels produce the same 
field, the relative power per unit area, normalized to the hole diameter 
2a, is 

P = 2a/th #7? + 2a/bH3, (10) 
where ¢, and ¢2 are the conductor thicknesses. The conductor of 
thickness ¢; is nearer the garnet. The two-conductor levels are sepa- 
rated by an insulator s units thick, as shown in Fig. 51. Values for P 


were computed at the maximum of ¥ near r = a for h/a = 0.8 subject 
to the height constraint 


at=h+8+ hr (11) 
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RELATIVE POWER P 


= 


T= (tz +5 4+to)/a 
s/a = 0.08 


LINEAR CURRENT DENSITY RATIO Jo/J, 


REDUCED THICKNESS 





0. 0.2 0.4 0.6 0.8 1.0 
REDUCED TOTAL THICKNESS + 


Fig. 51—Nomograph to determine the optimum thicknesses of levels 1 and 2 to 
minimize power dissipation in a dual-conductor circuit. Analysis based on a circular 
aperture of diameter 2a and garnet thickness h = a. 


For each of several + values, the minimum of eq. (10) was found. The 
results are shown in Fig. 51 together with the ratio of the current 
densities J2/J; for the two levels. This ratio follows from the require- 
ment of equal fields from the levels: J2/J; = #1/H#. 

The behavior of P as a function of 7 represents a trade-off between 
efficient field production—¥ large—and efficient power transmission 
in the conductor levels—t large. The best compromise between these 
conflicting requirements occurs for t ~ 1. The broad minimum of P 
about + ~ 1 permits a wide choice of + with only minor changes in 
power consumption. For example, a 50-percent reduction of 7 from 
0.96 to 0.48 causes only a 16-percent increase in P. For current 
experimental devices, typical dimensions are ¢; = 0.25 pm, s = 0.15 um, 
and ¢2 = 0.35 pm. Using a = 2 ym, one finds t = 0.38. Although a larger 
value does reduce power consumption, it also increases processing 
complexity. 
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7.7.2 Design of a major-minor chip with partitioning 


We now introduce partitioning of the conducting layers in a dual- 
conductor shift-register or major-minor chip design. In a partitioned 
chip, the main storage area and the major track are independently 
driven with a resultant reduction in chip dissipation. The shift-register 
and the major-minor chip layouts are very similar except that the 
shift-register uses parallel tracks rather than minor loops. Both of 
these designs allow for redundancy in the storage area. Given the 
functions of propagation, generation, annihilation, and transfer, the 
design of dual-conductor chips proceeds much as for field-access chips. 
There are, however, considerations peculiar to these current-accessed 
chip designs that we shall bring to your attention in this section. 

The design and performance characteristics of a hypothetical major- 
minor chip with an 8-ym period and 262,144-b capacity is outlined in 
Fig. 52. Similar designs have been proposed by J. L. Smith and P. I. 
Bonyhard. Bubbles are assumed to move in bursts at a 1-MHz stepping 
rate. The chip is partitioned into independently accessed storage and 
detector-major track zones. The resistance everywhere is assumed to 
be 0.1 Q/square. Each zone is driven at the current density indicated 
in the figure. If the entire chip was run continuously at 1 MHz, it 
would dissipate over 4 W. With partitioning, these zones can be 
independently accessed, reducing the chip dissipation substantially 
with no sacrifice in performance. In a partitioned chip it is, of course, 


CONDUCTOR ACCESS “‘MAJOR—MINOR” 


262,144 BITS 
1 MHz STEPPING RATE 
2.2 MSEC AVERAGE CYCLE TIME 
POWER AVERAGE 349 mW 
POWER WORST CASE 461 mW 
DETECTOR-—50 STEPS 


MAJOR TRACK 400um x tall 
4,096 STEPS \ 





{4mA/um) 









48mA Re fe ra a ES ea i, Pe eesti see ene ts mee aes 
(1.5mA/um) 






------ 1024L00PS - ---~----- 
256 STEPS/LOOP 
262,144 BITS 





1.54A 
(1.5mA/um) 






Fig. 52—Proposed design of a partitioned “major-minor’” dual-conductor chip. The 
chip is made rectangular to increase the drive impedance and partitioned to permit 
selective drive to reduce the on-chip power dissipation. 
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necessary to move bubbles between zones. Propagation across slots 
was discussed with generators in Section 7.4. 

It is desirable to make the impedance of the storage area compatible 
with driver circuitry. If the geometry of the storage area is laid out as 
a square, then it will present about 0.1 Q resistance to its drivers— 
completely independent of the storage capacity. However, if the con- 
ductor sheet is reconfigured as a rectangle, its resistance can be 
“transformed” into a better match for semi-conductor drivers. 

If the resulting “stick-like” chip is too awkward to handle, it can be 
folded, since the current (and bubbles) will have no difficulty negoti- 
ating the turns. A rectagular chip does have the advantage that it can 
dissipate heat more readily than a square chip and that it is well 
configured for a coilless package. 

The number of minor loops is determined by noting that 1024 loops 
are, in binary multiples, the most that can be driven from a 5 V supply. 
This storage area would dissipate 3.77 W if driven continuously, but 
its duty factor is very low (0.03) so the average power dissipation is 
just 110 mW. The detector and major track do run essentially contin- 
uously; however, they together represent a very small fraction of the 
chip area and, in total, dissipate only 239 mW. As a result, the total 
chip dissipation is but 349 mW. Further information on cycle time, 
duty factor, and power dissipation is summarized in Table II. If the 
chip capacity is quadrupled (1024 b/loop), the chip dissipation would 
only double and the cycle times would remain about the same. 


Vill. EPITAXIAL GARNET MATERIAL 


In this section, we examine the material requirements for current- 
access bubble devices. The specific material parameters of particular 
importance to this technology are discussed and a garnet system 
proposed which allows relatively high-domain wall mobility along with 
adequate uniaxial magnetic anisotropy. 


8.1 Growth and characterization 


All compositions grown for the current-access high-density bubble 
devices were grown on Gd;3Ga;Oi2 substrates by the LPE dipping 


Table II—Characteristics of partitioned 262-kbit dual-conductor chip 





Average Case Worst Case 
Transit ———___- 
Time Cycle Duty Power Cycle Duty Power 
Zone Steps (ms) Time Factor (mW) Time _ Factor (mW) 
Minor loops 256 0.256 0.128 0.03 110 0.256 0.06 230 


Major track 4146 4.146 4.146 0.97 239 4.146 0.94 231 
and detector 





4.274 1.00 349 4.402 1.00 461 
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technique using supercooled melts.'” Axial rotation was maintained 
constant at approximately 100 rpm during the growth period. The 
details of the apparatus and growth techniques have been discussed 
elsewhere.'*”! Growth temperatures were restricted to 875 + 60°C. 
Lattice parameter measurements were obtained using the HPM 
method.” Magnetic property measurements were obtained using the 
standard techniques.” 


8.2 Material properties 


Table III lists the parameters which must be considered when 
designing a material to be used in dual conductor-current access 
devices. Items 1 to 11 in Table III are important parameters to be 
considered when designing a bubble material for use in most bubble 
circuit technologies. Items 11 to 13 are of particular importance for 
dual conductor-current access devices. Because of the high-speed 
performance inherent in this technology, the mobility of the bubble 
material is of primary importance. Dynamic coercivity and bubble 
propagation angle are two other parameters that become increasingly 
important in small bubble devices. Low dynamic coercivities allow 
lower drive currents to be used for bubble propagation. A discussion of 
the importance of the bubble propagation angle is presented in Section 
7.3. An approach to increasing the mobility while maintaining adequate 
q values as well as an approach to lowering the dynamic coercivity is 
presented later in this section. 

Epitaxial garnet films of nominal composition Yj 2Smo,4Luo5- 
Cao 9Geo9Fes1012 have been shown to be useful in 16-um period field- 
access bubble devices.”® For 3-um diameter bubbles, this composition 
offers the advantages of domain wall mobility slightly in excess of 250 
cm s_' Oe™' along the q values of 5 and precise control of the temper- 
ature dependence of the magnetic properties by the appropriate ad- 
justment of melt chemistry. Figure 53 shows the material length 


Table IIl—Material parameters for 
current-access devices 


. Saturation magnetization, 47M, (Gauss) 
Exchange constant, A (ergs cm~’) 

. Uniaxial anisotropy, K. (ergs cm™) 

. Material length parameter, /(um) 

. Anisotropy field, Hx (Oe) 

. Quality factor, g (dimensionless) 

. Bubble collapse field, H.o1 (Oe) 

. Lattice parameter, L.P. (A) 

. Magnetostriction coefficient, Ari, Arvo 
10. Temperature dependence of Huu, (% °C™') 
11. Bubble mobility, u (cm/sec-Oe) 

12. Dynamic coercivity, AH. (Oe) 

13. Bubble propagation angle, ¥ (degrees) 


WCONHONARWN-H 
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Fig. 53—Material length parameter (1) vs strip width (sw) with the thickness-to-strip- 
width ratio as a parameter. 


parameter (/) as a function of domain strip width (sw) with the 
thickness-to-strip-width (h/sw) ratio as a parameter. Depending upon 
the h/sw ratio, the / parameter needed for a strip width of 1.7 um 
ranges from 0.115 to 0.235 um. For the case of h/sw = 1, the material 
length parameter must be 0.191 pm. 

Extending the (YSmLuCa)3(FeGe);Oi2 system to smaller bubbles 
results in the nominal composition Y10Smo5Luo07CaosGeosFes.2012 
which exhibits 1.7-~m diameter bubbles and the properties listed 
in Table IV. Materials grown from the’ general system 
(YSmLuCa)3(FeGe)sOi2 have exhibited dynamic coercivities ranging 
from 1.4 Oe to 3.2 Oe. The higher coercivities being measured are on 
the larger anisotropy materials, supporting smaller diameter bubbles. 
Because of the particular importance of bubble mobility in the current- 
access technology, a technique was needed to maintain a g value of 
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approximately 3 to 4 for materials exhibiting an / value of 0.16 to 0.19 
pum while also increasing the domain wall mobility. 

It would be advantageous to be able to decrease the samarium 
content of the composition listed in Table IV while maintaining the q 
value, which would result in an increased mobility. One approach to 
achieving this is to increase the concentration of the most “anisotropy 
active” constituents. Figure 54 shows the calculated uniaxial aniso- 
tropy as a function of the lutetium content of garnet compositions in 
the three systems (YLuSmCa)3(FeGe)sOi2, (LaLuSmCa)3(FeGe)s5O1, 
and (LaLuSm)3(FeGa);O;2. The samarium content and the Curie tem- 
perature are used as parameters. Curves A through C are calculated 
for Ca-Ge substitution of ~0.8 moles per garnet formula unit and a 
Curie temperature of ~480°K. Curves D and E represent gallium 
substitution leading to a Curie temperature of ~440°K. We point out 
that the curves shown in Fig. 54 do not take into account the lattice 
constant constraint usually imposed on epitaxial garnet films. The 
compositions required to achieve a lattice constant match with 
Gd3Ga;O12 (GGG) substrates (12.383A) are indicated by the point on 
each curve marked L.M. Comparison of curves A and B in Fig. 54 
reveals that, to decrease the samarium content from 0.5 to 0.3 in the 
garnet formula and maintain both the anisotropy of ~35000 ergs-cm™~* 
and a lattice parameter match with the ccc substrate, compositions 
outside the (YLuSmCa)3(FeGe);O12 system must be considered. Curve 
C indicates that the (LaLuSmCa)3(FeGe);O012 system allows both the 
uniaxial anisotropy of ~35000 ergs-cm™ and a lattice match to be 
achieved. A discussion of the advantages of lanthanum have been 
presented elsewhere.”’ 

It is well known that materials with high 47M, require relatively 
little diamagnetic ion substitution. The advantages realized in the 
divalent-tetravalent ion substitution system can be obtained with less 
complicated crystal compositions when high moment material is re- 
quired.”* For materials exhibiting a high 47M,, Ca®*-Ge** substitution 
could result in too high a Curie temperature making it difficult to 


Table I\V—Composition: 
Y1.09Mo.5LUo 7Cao sGeo aF es. 2012 


Lattice parameter 12.383 A 

4M, 510G 

Ku 36,600 ergs cm™? 

l 0.19 pm _ 

A 2.7 x 107°‘ ergs cm™! 
Hi, 1800 Oe 

q 3.5 

jt ~300 cm/S-Oe 


AH. (dynamic coercivity) 3.0 Oe 
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(YSmLuCa)3 (FeGe), 01 
(YSmLuCa)3 (FeGe), 042 
(LaSmLuCa)3 (FeGe), O19 
(LaSmLu)3 (FeGa)5 O42 
(LaSmLu)3 (FeGa)s O12 


UNIAXIAL ANISOTROPY (K,, x 10° ergs ~cm—3) 





0 0.3 0.6 0.9 1.2 1.5 1.8 2.1 2.4 2.7 
LUTETIUM CONTENT IN MOLES PER FORMULA UNIT 


g. 54—Calculated uniaxial magnetic anisotropy vs lutetium content for the systems 
(YSmiLuCa)s(FeGe)sOrn (LaSmLuCa)3(FeGe)50,2, and (LaLuSm)3(FeGa)sOi2. The sa- 
marium content and the Curie temperature are varied. The ccc lattice matching 
compositions are indicated by the symbol LM on each curve. 


control the temperature dependence of the magnetic properties to 
match the barium ferrite bias magnets. 

A slight imbalance of the divalent-tetravalent ion ratio could also 
result in increased coercivity. The immersion of a gallium garnet 
substrate into a melt supersaturated with an iron garnet containing 
Ca”*-Ge** might be expected to show increased substrate dissolution 
prior to initial film growth as compared to the same substrate im- 
mersed in a melt supersaturated with a gallium containing iron garnet. 
Considering the complications in melt chemistry and the careful con- 
trol of growth conditions necessary to minimize compositional fluctua- 
tions in Ca’*-Ge** substituted garnets, a move to simpler film com- 
positions would be advantageous. 

Curve D _ illustrates the anisotropy calculated for the 
(LaLuSm)3(FeGa)5Oi2 system. Note that the lattice match composition 
occurs at a lutetium concentration of ~2.15 moles per garnet for- 
mula unit and that a uniaxial anisotropy of ~35000 ergs-cm™* can 
be obtained. Table V _ lists a nominal composition in the 
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(LaLuSm)3(FeGa)sO.2 system along with some measured magnetic 
properties. Curve E shows that lowering the samarium concentration 
to 0.13 moles per garnet formula unit results in a significantly lower 
anisotropy for the Gac lattice matching composition. 


8.3 Dynamic coercivity 


The dynamic coercivity of epitaxial bubble garnet films supporting 
~1.7-um diameter bubbles has been measured as a function of film 
thickness.”” Figure 55 shows the dynamic coercivity measured by a 
bubble translation technique as a function of inverse film thickness. 
All curves are for material exhibiting 1.7-um diameter bubbles. Curve 
C is for a sample in the (YLuSmCa)3(FeGe);O12 system, while curves 
A and B represent two compositions in the (LaLuSm)3(FeGa);O12 
system. Curve A is for a samarium concentration of ~0.13 moles per 
garnet formula unit, while curve B represents data taken on a sample 
containing a samarium concentration of 0.3 moles. Note that the 
gallium material exhibits significantly lower dynamic coercivities than 
does the Ca”*-Ge** substituted material. 


8.4 Melt chemistry 


The phase equilibria observed in the (YSmLuCa)3(FeGe)sO12 system 
have been presented previously.’ Lanthanum does not form an iron 
garnet, and lutetium iron garnet has been reported being grown only 
once in bulk form,” possibly because of phase equilibria difficulties. 
As the concentration of lanthanum increases in the pseudo-ternary 
Flux—Fe.0;—)Ln.Os, the garnet phase field would be expected to 
narrow considerably. Figure 56 shows a section of the pseudo-ternary 
Flux—Fe20;—>Ln203, where )'Ln2O3 corresponds to the sum of 
La2O3+Lu203. The ratio of La,203;/Lu203 was fixed at 0.332 to allow 
stress-free epitaxial film growth on ccc substrates. Note that all data 
shown in the pseudo-ternary are for unsubstituted iron garnets. Sub- 
stitution of diamagnetic ions could be expected to shift the location of 
the phase boundaries. Addition of gallium results in the garnet-pe- 
rovskite boundary being displaced towards the Flux—Fe203 binary. 
Depending on the overall garnet oxide concentration in the melt, the 


Table V—Composition: 
Lap glues 1+SMo.3Gao9Fe4.1012 


4nM, 466G 

K, 35,500 ergs cm? 

A 2.0 x 10‘ ergs cm™ 
l 0.18 pm 

Hr 1915 Oe 

Q 4.1 

Be 500-1000 cm/S-Oe 
AH, (dynamic coercivity) ~2 Oe 
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Fig. 55—Bubble dynamic coercivity (AH.) as a function of inverse film thickness for 
1.7-um bubble material. Curve C is for (YLuSmCa);(FeGe);Oi2. Curves A and B 
represent two compositions in the system (LaLuSm);(FeGa)sOi2. Curve A is for a 
samarium concentration of ~0.13 moles per garnet formula unit. Curve B represents 
data on a sample containing 0.3 moles of samarium. 


range of Fe203/)'Ln2O3 over which garnet is found to be the primary 
phase field is from 6 to 9. For comparison, the Fe203/Y203 ratio range 
for garnet in the Flux—Fe20;—Y203; pseudo-ternary is from 12 to 40. 

Because of the large size of La®*, one would expect a relatively low 
value for the La®* distribution coefficient. The definition for the 
distribution coefficient of lanthanum is: 


( La 
Laan 
pia = crystal 
La 
(rotsisn) 
where the symbols in the fractions represent the number of moles 
present. Although the value for the effective coefficient will be rate- 


dependent, the value for k'* under average growth conditions is found 
to be ~0.42. 


1506 THE BELL SYSTEM TECHNICAL JOURNAL, JULY-AUGUST 1979 


The use of compositions in the (LaLuSm)3(FeGa);O12 system allows 
less complicated crystal compositions to be used. Lower dynamic 
coercivities and increased domain wall mobility can be obtained. 


IX. PROCESSING 


There are two major concerns in processing dual conductor struc- 
tures. First, the two conductors should both be as close as possible to 
the epitaxial film supporting the bubbles. Hence, a prespacer (analo- 
gous to the prespacer used between conductor and epitaxial layer in 
standard field access devices) should be as thin as possible or com- 
pletely omitted. In addition, it is immediately obvious that the dielec- 
tric layer between the conductors must be as thin as the limits of 
dielectric breakdown permit. Second, the very small period devices 
possible within the material constraints of this technology will require 
increasingly precise control for etching the fine-aperture metal film 
pattern. What is required is an etching method that will produce metal 
patterns dimensionally identical to the etch mask. 


9.1 Material requirements 
9.1.1 Insulating films 


If a prespacer is used, the material and/or process selection is quite 
unrestrained as the surface is completely planar and the layer thickness 


PbO LEGEND: 
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Fig. 56—A section of the pseudoternary Flux—Fe20;—2Ln20s, where 2Ln2O3 rep- 
resents La,O3 + LueQOs3. 
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required is very small. Thus, several avenues can probably be used: RF 
diode-sputtered SiOz or SiN; RF magnetron or S-gun deposited SiO, or 
SiN; plasma-deposited or cvD-deposited SiOz or SIN. In all the work 
reported, prespacers were SiOz deposited by RF plasma deposition 
using an Applied Materials Plasma I radial flow system. 

The insulating layer between conductors has more stringent require- 
ments due to the multiplicity of crossovers and the need for minimum 
thickness. Thus, very good dielectric properties are required and a 
process that deposits a conformal coating is mandatory, 1.e., equivalent 
deposition rates on vertical and horizontal surfaces are required (Fig. 
57). Higher-pressure processes such as plasma-deposition or CVD are 
clearly preferred over line-of-sight deposition methods such as evapo- 
ration or sputtering. A relatively low-temperature (c.f. 250 to 280°C) 
plasma SiO, deposition process has been developed using the Applied 
Materials Plasma I apparatus.®’ Dual conductor circuits fabricated 
using the plasma-SiQO>2 insulator exhibit breakdown potentials equiva- 
lent to those expected for high-quality, thermally grown SiO, films of 
an equivalent thickness. We thus conclude that the step coverage over 
the dual conductor features is optimal. 


9.1.2 Conductor materials 


Though any highly conductive metal alloy system may be used, Al 
or Au alloys appear quite suitable, and most of the work reported has 
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Fig. 57—Effect of conformal vs line-of-sight deposition on insulating film thickness at 
crossovers. 
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used Al alloys with low copper content (0.7 to 4 percent). The AlCu 
alloys are particularly suited because the relatively high current den- 
sity required for some functions in the dual conductor circuits (>10° 
A/cm’) indicates the use of conductor materials with low electromi- 
gration rates. The AlCu alloys have been evaporated using a standard 
e-gun source and planetary wafer fixturing to maximize step coverage. 
It is advantageous to carry out this deposition at as low a substrate 
temperature as possible to prevent stress buildup in the AlCu films.” 


9.2 Metal patterning 
9.2.1 Photolithography 


Standard contact-print photolithography was used on devices re- 
ported here.” We anticipate that other high-resolution methods such 
as projection printing, X-ray exposure, etc. may be used equally well. 


9.2.2 Metal patterning 


In standard rotating magnetic field-access bubble devices, the Al 
conductor layer is relatively simple, and at least for 16-wm period 
circuits can be etched with standard wet chemical methods. In dual 
conductor-accessed circuits, however, wet-etch processes are unusable 
in small period circuits due to the undercutting of the photoresist 
pattern by the isotropic etch process. This undercutting severely limits 
the dimensions achievable in conductor patterning. In fact, since these 
circuits will invariably require that near state-of-the-art resolution be 
reproduced in the resist mask, no isotropic process or process which 
biases the conductor line or space dimensions with respect to the 
photoresist pattern should be used. In the following section, we de- 
scribe a plasma-etching process that does indeed exactly reproduce the 
etch mask in the Al film. It is conceivable that other methods may be 
used, such as the planar anodization method described by D. K. Rose.™ 

For an etch patterning process, very high material etch selectivity 
will be required. If no prespacer is used, the first-level etch process 
cannot be allowed to cut into the epitaxial garnet film and thin it 
selectively. Similar constraints operate when etching the second con- 
ductor, since any thinning of the underlying insulator layer will ob- 
viously reduce the dielectric breakdown strength between the two 
conductors. Plasma etching is an obvious choice to satisfy the require- 
ments for etching Al alloy films for high-density dual conductor circuits 
for the following reasons: (1) plasma etching uses a chemical process to 
remove metal and very high etch selectivity can be achieved for 
different materials, (71) careful selection and control of plasma process 
parameters can replicate photoresist patterns in aluminum films with- 
out measurable (<0.1 zm) dimensional variation. (A good survey of 
plasma etch processes can be found in Refs. 35 and 36.) 
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The particular RF (13.56-MHz) plasma process adapted is based on 
one developed by D. Wang, et al.*” The apparatus used was designed 
by C. J. Mogab and F. B. Alexander,® and the etch chamber is shown 
schematically in Fig. 58. The basic etch process involves the following 
reaction: 


Al(s) + 3Cl(plasma) — AICIs(solid) 
followed by volatilization of the AlCls3 solid 
2AlCl;(solid) — (AICI3)2(gas). 


There is evidence that the anisotropic etch process is promoted by the 
impact of charged particles on the horizontal surface (Fig. 59). 
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ETCH GAS “ L GAS 


Fig. 58—Radial flow plasma etch apparatus. 
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Fig. 59—Schematic of anisotropic etch process. 
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Obviously, a bias-free etching process will produce near-vertical 
walls on the metal patterns, which might pose problems in the cross- 
over features in the dual conductor circuits. To date, the combination 
of conformal plasma SiO: and good step coverage in the Al evaporation 
process have minimized the potential problems, both with respect to 
dielectric breakdown and also propagate drive margin reduction, which 
might be caused by high-resistance crossovers. Figures 60 to 63 are 
SEM photographs of plasma-etched, dual-conductor patterns. Figure 60 
shows a horizontal, straight-line propagate path, while Figure 61 shows 
several vertical loops. Figure 62 shows one of the generator designs 
tested. Figure 63 shows, at the same magnification, three data storage 
loops designed and fabricated within identical linewidth and alignment 
tolerances. The three loops are of 8-, 6-, and 4-jm period. 


X. MAGNETIC FIELDS AND BUBBLE MOTION 


Early conductor-drive devices**° confined currents to discrete con- 


ductor lines narrower than a bubble diameter. The present approach 
to conductor propagation distributes current over a conducting sheet 
with openings which provide the necessary spatial variations of mag- 
netic field. To furnish some insights and a design aid for conductor- 
access propagation, two simple geometries—parallel conducting stripes 





Fig. 60—An sem of an 8-um period dual-conductor propagate pattern. The darker 
apertures are in the first conductor and the lighter in the second conductor pattern. 
These apertures were etched in the AlCu alloy by a plasma etch technique using a Cl- 
containing gas (2860X). 
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Fig. 62—An SEM of an experimental generator pattern plasma etched in AlCu (3160X). 
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les (2870X). 
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Fig. 63—Data storage loops plasma etched in AlCu films. The three patterns follow identical dimens 


(a) 8-~m period 
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and a circular hole in an infinite conducting sheet—were investigated. 
Bubble motion in a gradient field was also studied. 


10.1 Stripe conductors 


Consider an infinitesimally thin conducting sheet in the plane z = 0. 
From this sheet, infinite in the y direction, remove material to form 
conducting stripes of width 2a and spacing s > 2a; see Fig. 64. Each 
stripe carries a current J = —2a-J, in the y direction, where -J, is the 
constant linear current density of each stripe. In MKS units, the 
contribution*’ 


H.(n) = (Jo/27) In[ro(n)/ri(n)] (12) 


of the nth stripe to the z component of the field intensity may be 
expressed in dimensionless form by the quantity 


4rH.(n) (€+a—no)’? + ° 
KH,  — eet ed ] er 

(n) Jo nf —a—no)’+ &]’ 
where the lengths r;(7) and ro(n) are defined in Fig. 64, § = x/h, {= 
z/h, « = a/h, and o = s/h. For bubble propagation, one is interested 
in the average intensity 


(13) 


25th 
Hn) = h” i H.(n) dz, (14) 


where z; is the garnet-conductor separation and hf the garnet-film 
thickness. After integrating eq. (14) and summing over the stripes, one 
finds 


Hs, =F Y (-1) [fn A,; + 28,(n)Arctan C,,], (15) 


J=1 


z 


P(x,z) 





eae 


n=—1 n=0 n=+1 
Fig. 64—Coordinate system for an array of infinite conducting stripes. The conductors, 


2a units wide and spaced s units apart, occupy the plane z = 0. Each stripe extends to 
infinity in the + y directions, carries a current J, and bears an index n. 
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where 
H, = 4rH./J, = dimensionless average intensity, 


H, = ¥. H,(n) = average intensity, 


h=2/h, &=&4+1, Bln) =é+ (-lYa-n0, 
An = [G + Bn) 1/1 + Bin), 
Cry = Bln) /[Bi(n) + $18]. 


When the number of stripes is small, say, less than 100, the best way 
to evaluate eq. (15) is by programming the right-hand side as it stands. 
As the number of stripes n, is increased, this procedure becomes less 
attractive because 2n, evaluations of the logarithm and arctangent 
functions are needed for each value of x. Note that, for n; < », H, is 
not periodic; for a complete numerical description of eq. (15) in the 
vicinity of the stripes, the number of function evaluations is 2n,n,n?2, 
where n, is the number of points per period s and n, the number of a 
values of interest. 

When n, is large, say, n; > 100, the behavior of #, near the array 
center may be approximated by an infinite number of stripes. This 
choice permits a significant reduction in computation time but fails to 
give information about the behavior near the array edges—a matter 
that will be investigated separately. As n, — ©, the dimensionless 
average intensity, regarded as a function of &, has the properties 


HE) =A (E +0) and #,(-€) = H.(+46). (16) 


Consequently, one can restrict evaluation of #, to the points 0 <é< 
o/2. Equation (16) implies #,(0) = #,(0/2) = 0. This result may also 
be deduced from symmetry considerations. For an origin at the center 
of an opening between stripes, the same symmetry arguments require 
KH ,to be antisymmetric about this origin. For numerical evaluation of 
KH, in the limit ns > ~, eq. (15) was rewritten as 


A =H, + H2, (17) 


where 


2 N 
#1 =), (-1/6] Indy + » [InAnjA-n,] 
N 
- (af/o)| 5 =) | 
— [Bag(e? + a? — agy/otl| | | 
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2 +N 


Hr = 23) YY [(-1)/B(n)Arctan Cyj] 


J=1 n=-N 
N 
— [16aé(S2 — D/o4] sey = X | 


¢(k) = Riemann zeta function.” 


Equation (17) adds the contributions from the 2N + 1 stripes nearest 
the origin as they stand in eq. (15) to the contribution from the 
remaining stripes approximated by a truncated series in the inverse 
powers of the stripe index n. Since the leading term in the power series 








0 
0 0.05 0.10 0.15 0.20 0.25 


x/s 


Fig. 65—Dimensionless average value % of the z component of magnetic field as a 
function of x/s. The curves are graphs of eq. (17) for s = 4a and z, = 0. The ordinate #, 
is numerically equal to the field in oersteds when the linear current density of each 
stripe is 1 mA/pm. 
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Fig. 66—Same as Fig. 65, except x, = 0.1a. 


is n~*, the field at the origin due to a pair of stripes, remote from and 
symmetric about the origin, decreases as the inverse square of the 
distance of either stripe from the origin. Accuracy requirements deter- 
mine N. For the choice N = 10, used in the computations presented 
here, the estimated error in #, is less than 10~°. Figures 65 to 69 show 
graphs of #, vs x/s for parameter values useful to the device designer. 
Curves of H for z; = 0 are unique in that each has a cusp at x/s = 
0.25, where the slope d#./dx is not continuous. The choice s = 4a 
requires calculation only for 0 < é < o/4 because #, is symmetric 
about = +0/4. The origin of this symmetry may be understood by 
superimposing another current sheet with current density J = (J./2)7 
on the existing stripe pattern of Fig. 64. This addition leaves the z 
component of field unaltered. It creates, however, a system of adjacent 
conducting stripes of width 2a, carrying the current J = J,a alternately 
in the positive and negative y direction. Such an array of stripes enjoys 
symmetry about the points of discontinuity. Contributions to the z 
component of field from any pair of stripes symmetrically disposed 
about a selected discontinuity will add; #, will, therefore, have extrema 
at points of current-density discontinuity. To illustrate how the aver- 
age field intensity decreases with increasing distance from the conduc- 
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Fig. 67—Same as Fig. 65, except z, = 0.2a. 


tor array, values of #, were computed for several stand-off distances 
z;. Since #, assumes its maximum value at x/s = 0.25 for s/a = 4, the 
stripe edge was chosen for the calculations. See Fig. 70. 

To convert to cgs units, write H, = #,J, Oe and express Jp in 
A/mm or mA/ym. In other words, the dimensionless quantity #, gives 
the average field intensity in oersteds when the linear current density 
Jo is 1 mA/pm. 

The gradient of the average field dH./ax can be obtained with only 
a slight increase in computing effort because the needed function 
values are available. The dimensionless gradient is 








ss 4rh\ dH, 
= J 
G ( 7, ae =2 zy (—1)’Arctan C,,;. (18) 


When the number of stripes 7, is large, one may use the approximation 


+N e 
G=2 », Y [(—1)’Arctan C,,] — (8a/04) (2) - yn] 


_ N j= 1 


— (8a/0*) [3 + a? — & — 4 — §] sw) -y » n a (19) 


an expression correct to the 4th order in the quantity 1/n. Equation 
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(19) adds the contribution from the 2N + 1 stripes nearest the origin 
to the contribution, approximated by a truncated series in powers of 
1/n, of the infinitely many remaining stripes. Accuracy requirements 
determine the value of N. 


10.2 Infinite conducting sheet with circular hole 


Consider again an infinitely thin conducting sheet in the plane z = 
0. Let the sheet contain a circular hole, extend to infinity, and support 


al 
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Fig. 68—Same as Fig. 65, except z, = 0.3a. 
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Fig. 69—Same as Fig. 65, except 2, = 0.4a. 


the constant current density J = J,i at points infinitely removed from 
the hole. The origin of coordinates is chosen at the symmetry point of 
the geometry as shown in Fig. 71. The problem to be solved—a means 
of obtaining the z component of the magnetic field at the point P(r, 
§)—falls into two parts: (i) solution of a boundary-value problem to 
yield the linear current density J in the plane z = 0, and (iz) integration 
over the source distribution to obtain the field. 

The first of these is discussed by Milne-Thompson® in connection 
with two-dimensional fluid flow around a cylindrical obstacle. The 
current density is 


J = J.[1 — (a/p)*cos 26]i — Jo(a/p)’sin 29, (20) 
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where 
J. = constant linear current density at infinity, 
a = hole radius, 
= radial source coordinate, 
= angular source coordinate. 


The reader may quickly verify that eq. (20) satisfies the requisite 
boundary conditions. The current density is tangential at p = a and 
J — Ji as p > «. One can now obtain the magnetic field in MKs units 
from 





s/a=4 
2 x/s=0.25 


Ns=00 





0 1 2 
z,/a 
Fig. 70—Dimensionless average value #, of the z component of magnetic field as a 
function normalized stand-off distance z,/a. The curves are graphs of eq. (17) for s = 4a 


and x = 0.25s. The ordinate #, is numerically equal to the field in oersteds when the 
linear current density of each stripe is 1 mA/pm. 
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Fig. 71—Current sheet with circular hole. The dimensionless z component of the 
magnetic field is shown as a function of the reduced distance » = r/a for several values 
of vy = z/a. The curves are graphs of # defined by the integral below eq. (23). The 
ordinate # is numerically equal to the field in oersteds when 6 = 90° and the linear 
current density approaches 1 mA/pm as r—> ©, 





JxR 
47H = | re dé dn, (21) 


where 
R=(x-Si+(y—nj+zk and R=|RI. 


The surface integral, with respect to the source coordinates é and y, 
extends over the entire € — 7 plane apart from the points p = 
(¢ + 7°)’ < a, where the current density vanishes. After projecting 
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eq. (21) on the z-axis and expressing all components in cylindrical 
coordinates, one obtains a double integral of the form 


27 00 
4H. = J, | de | D(r, 0, 2, p, 6) dp. (22) 
0 a 


All radial integrals are tabulated; see, for example, Gradshteyn and 
Ryzhik.** The remaining angular integration, however, must be done 
numerically. After appropriate changes of the variables and collecting 
terms, eq. (22) reduces to 


4nH, = J,H sin 8, (23) 


where 
w= 2 | (FP, + Fy. + F3) dy = XH (p, v), 
0 


R= Co + CC + CC? + C3C? + C.C* 
(a2 — w’C*)(o? + 1 — 2uc)'”? ’ 
Fy = Cin{(o? + 1 -2uC)'”? + 1— nC], 
o(o? + 1 — 2uC)' + 0? — ad 


F3 = p(2C? — Dota] 
o— pC 


= r/a, vy = 2/4, 


o = (p+ v)'””, C 


cos ¥, 
Co = po’(1 + 0’), 

Cy = (o* + p’)(1 — 0°), 

C2 = —p[o7(o? + 3) + 27], 

C3 = 2u?(o? — 1), 

C, = 4p’. 


Numerical integration over the angular variable of eq. (22) employed 
the automatic numerical quadrature routine QUAD.*” The dependence 
of # on the radius and altitude is shown in Fig. 71. Using this figure 
and eq. (23), one may now estimate H, in MKS units. In the hybrid 
units described earlier, the z component of magnetic field in oersteds 
is numerically equal to # when J, = 1 mA/um and 6 = 7/2. 

To obtain information about the average value, 


z,th 5 2th 
s “ g s 
oe | a | Hp, v) dz, (24) 





A Arh 
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the dimensionless quantity 


Zyth vyty 
P=H#Hiy =k | #Hdz=y~ | H (u,v) dv (25) 


Zs 


was computed. Here pv, = z;/a and y = h/a. Since several y values are 
of interest, one may take advantage of the linear property of integration 
to reduce the number of function evaluations. Index the y values of 
interest with 7 such that y,;+1 > y;. To obtain a uniform formula and 
accommodate Fortran requirements, one must choose jy: = 0 and the 
smallest value of interest y2. Then, with the notation H; = H(u, yj+1), 
one finds 
= 1 J 
A, jf Ei» (26) 


Vj+1 i= 


where 


Ust+YVi+1 
gi = | H (p, v) dv. 


st Yi 


Each g; was evaluated with a 3-point Gaussian rule; for the y; used in 
the computation the estimated error is of order 10~*. The radial slope 
S= dH /dn was also computed using the approximation 


f (ue, Yj) = [# (Ue+1, Yi) — KH (ux-1, /)]/2Au, (27) 


where Au is the constant difference between adjacent values of p. The 
dependence of —# and —¥ on r/a is shown in Figs. 72 and 73, 
respectively. Contour maps of constant H. demonstrate the overall 
behavior of the field. Figure 7 shows contours of H, = constant for the 
reduced stand-off distances z,/a = 0.06 and 0.3. For both, h/a = 0.8. 


10.3 Linear chain of oval holes 


Current experimental devices use quasi-elliptic openings rather than 
circular holes; see, for example, Fig. 29. The work of Richmond“ with 
electric fields about a linear grating permits more realistic magnetic 
field calculations. Consider an infinite chain of identical oval holes on 
the y-axis, spaced 2b units apart, each with semiaxes a2 > a, parallel 
to the y and x axes, respectively. See Fig. 74 for the oval at the origin. 
Because of symmetry and the periodicity 2b in the y direction, La- 
place’s equation need be solved only in the region of Fig. 74. Rich- 
mond” gives the conformal transformation 


z = (ai/c)w + (a2/y)cosh ‘(cos y cosh w) (28) 


1524 THE BELL SYSTEM TECHNICAL JOURNAL, JULY-AUGUST 1979 





t/a 


Fig. 72—Dimensionless average value # of z component of magnetic field as a 
function of r/a for an infinite current sheet with a circular hole. The curves are graphs 
of eq. (25) for h = 0.8a, where a is the hole radius. The ordinate # is numerically equal 
to the field in oersteds when 6 = 90° and the linear current density approaches 
1 mA/pm as r— , 


from which one finds the complex linear current density 





y= 2bJ. | dw . _ azycJ,(cos*y cosh?w — 1)!” ‘ 
a | dz axy(cos*y cosh*w — 1)’ + ¢ cos y sinh w od 


outside the oval. Here -J,, a real constant, is the linear current density 
as X = > ©, a = 1/2, a2 = 2b/Ta2, z = x + Ly, Ww = Uu + Iv, and * 
means complex conjugate. Inside the oval, J = 0. The real parameters 
c and y satisfy the nonlinear equations cos y cosh c = 1 and c(ya2 — 1) 
= ay. Elimination of c from the equations leads to an efficient 
algorithm, based on Newton’s method, for machine computation of y 
for practical values of a; < 1 and a2. The iteration scheme 
oh 
fi 


. vigilaryi + gi(1 — azyi)|cos y; (30) 


Vit+1 ae Vi L any? He £7C0S Vi ’ 
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Fig. 73—Dimensionless slope 7 = d#/du of the z component of magnetic field as a 
function of r/a for an infinite current sheet with a circular hole. The curves are graphs 
of eq. (27) for h = 0.8a, where a is the hole radius. 


where 
f=ag'+y"— a, 
g = In{(1 + sin y)/cos y], 
f = df/dy, 
fi= fyi, 
fi = df/dy evaluated at y = yi, 


may be started with y,. = 1/az. If the iteration process ends with y;, the 
parameter c follows from c = g; = In[{(1 + sin y,)/cos y;]. 

By setting u or v constant in eq. (28); one obtains equipotential or 
flow lines, respectively. Several flow lines, indexed with the angle 
variable ¥ = 180v/z, are shown in Fig. 74. The oval satisfies the 
parametric equations 


x/a2 = (a:/c)cosh~'(q/cos y) (31a) 
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x/ay 


Fig. 74—Current flow near an infinite chain of oval holes. Identical oval holes with 
semiaxes a2 = 2a; and spacing 2b = 4ae occupy the y-axis. The flow lines, indexed with 
~ = 180v/z, were obtained from eq. (28) with v = constant. Note the almost elliptical 
shape of the oval for the parameter ratios a;/az = 0.5 and b/az = 2 used here. 


and 
y/a2 = (1/y)cos~"q, (31b) 


where 0 < cos y $ q < 1. Richmond“ discusses the departure from 
ellipticity. For geometries of practical interest, say, ai/az ~ 0.5 and 
b/a2 ~ 2, the oval may be regarded as an ellipse. The distribution of 
current along the line of minimum separation between ovals satisfies 
eq. (29) with u = 0. It was computed from 


J azyc(1 — cosy cos’v)'”” 


yA O<xuxa7/2 = (32) 


J,  ayy(1 — cos’y cos*v)'”” + ¢ cos y sin v’ 


and is shown in Fig. 75. The coordinate value associated with v follows 
from eq. (28). 

To complete the field computations, an efficient method for the 
numerical double integration of the components of eq. (21) must be 
constructed. This work is in progress. 


10.4 Bubble motion 


Experimental evidence, Fig. 34, for example, strongly suggests bub- 
bles with different hardness parameters. A study of bubble motion will 
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Fig. 75—Distribution of linear current density along the line of minimum separation 
between ovals of an infinite chain. The current density is normalized to Jo, its value 
infinitely far from the ovals. The graphs were obtained from eq. (32). 


aid the device designer by prediction of trajectories and transit times. 
With the assumption that the bubble speed,*’ 


V = p[do|VH-| — 8H./7]= 0, (33) 


is independent of the hardness or S number, the equations of motion 
in rectangular coordinates are 


x; = —VTy,e;, (34) 


where V is a scalar point function, p, is the bubble mobility equal to 
one-half the wall mobility, d, is the bubble diameter, H, is the z 
component of field averaged over the bubble height, His the coercivity 
field, Ti = T22 = cos $, Tiz = —Tx = sin ¢, é = VH./|VH-|, e; is the 
projection of é on the jth coordinate axis, and ¢ is the angle between 
—é and the local velocity V. The angle ¢ depends on S according to 
sin ¢ = 8SV/yd3|VH-| and is regarded as a fixed parameter in eq. 
(34). Here y is the gyromagnetic ratio of the material.** See also Fig. 
76. When applied to bubble motion due to fields from a hole of radius 
a in a current sheet, eqs. (33) and (34) assume the form 
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& = —(Fe/p*)[ibo(u? — H)coso + (uP + &H)sin ¢)], (35a) 
bo = +(Fe/p’) [Edu — #)sin 6 — (uP +HH)cos $], (35b) 
where 
F. = 1 — 8H./ads|V Hz |, 
| VH.| = (Jo/a) LF sin? + (#/p)* cos’6]'”, 


sin 0 = £2/p, 
cos 6 = £1/p, 
w= (E+ &), 
£ = x/a, 
£2 = y/a, 
S = dH / dp. 


Hybrid units were used for eq. (35); this allows substitution for H. in 
oersteds. The current density J,, however, must be in units of mA/yum. 
The dot over the variables on the left-hand side of eq. (35) means 
differentiation with respect to the normalized time T = t/t, where + 
= a’/us ds Jo. Solutions to these coupled, nonlinear differential equa- 
tions must, in general, be obtained by numerical techniques. Some 
special cases, amenable to an analytical approach, are now described. 

The equations of motion, eq. (34), assume a simple form when 
written in polar coordinates: 





Fig. 76—Coordinate system and symbols used in the equations of bubble motion. 
Note that the angle x, as shown here, is a negative quantity because the projection of 
VHz, on the unit vector @ of the polar coordinates (r, 8) is negative. 
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i = dr/dt = —Veos(¢ — x), (36a) 
§ = d0/dt = +(V/r)sin(¢ — x), (36b) 


where x is the angle between VH, and fF a unit vector in the radial 
direction. Division of eq. (36b) by (36a) yields the velocity ratio 


r0/7 = v/v, = tan(x — 9). (37) 


Consider now the special case x = 0. This implies dH./d0 vanishes. 
Since ¢ is a constant, eq. (87) may be integrated to furnish the 
trajectory 


§@= 0. + tan ¢ln(r./r), (38) 


where r, and @, specify the position at some reference time, ¢ = ¢,. The 
path is a logarithmic spiral; it is unique in that it is independent of the 
radial behavior of H,. The dependence of H. on r will, however, 
determine the position and velocity along the spiral as a function of 
the time by inversion of the quadrature 





t=t,+sec¢ | _ (39) 


Equation (39) follows from eq. (36a). When the functional dependence 
of H, on r is sufficiently simple, eq. (39) may be integrated and 
inverted. For example, when H, = H.(r/d,)?, V = v(r) = 
(r — a dy)/t.. Here H, is a constant, a = 4H./7H, To = db/pwHo, and 
lly = 2ps is the wall mobility. Integration of eq. (39) furnishes 


r = ad, + (ro — ad»)exp(—Ad), (40) 


where A = 7.'cos ¢ and ¢, = 0. Figure 77 shows several trajectories. 
Bubble motion starts at a point r = r, marked ¢t/r. = 0. From there, 
bubbles travel along logarithmic spiral paths terminating at the origin. 
The numerals along the various paths give bubble position in units of 
t/t. for ad,/r, = 0.5. When H. = 0 and 0 S ¢ < 90°, all bubbles spiral 
into the origin arriving there as t > ™. 

Two special cases of bubble motion near a circular opening allow 
integration of eq. (35) by quadrature. For both, S = ¢ = 0. Consider a 
bubble located on the y axis at ¢ = 0. Using polar coordinates, one 
finds immediately 6 = constant since ¢ = 0 and x = 0 or 7; see eq. (36). 
The radial equation is solved by the quadrature 


r/a 
do 
=T ~~ (41) 
| | Ao) |—B 
for motion starting at the origin. Here, t = a?/podsJ, and B = 


8H.a/7dzJ, with H, in oersteds and J, in mA/um. Numerical evalua- 
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Fig. 77—Logarithmic spiral trajectories described by eq. (38). Bubbles spiral into the 
origin along these paths when H- = 0. For H. > 0, motion is along the same paths but 
bubbles do not reach the origin. The timing markers, in units of ¢/r,, show positions for 
ad,/Tro = 0.5. 


tion of eq. (41) for a = dy = 2um, H, = 1 Oe, ps = 1 pm/ps-Oe, J, = 1 
mA/um, z;/a = 0.06, and h/a = 0.8 shows that 0.70 us are required for 
motion from r = 0 to r = 0.8a. For these parameters, the characteristic 
time t equals 2 us. Arrival at r/a = 0.8 completes approximately 90 
percent of the possible radial motion. 

The second special case occurs when a bubble resides at the radial 
minimum of # at t = 0. Enter eq. (36) with ¢ = 0 and x = —7/2. Thus, 
r = re = constant, and the equilibrium value r. is found from the 
solution of /(u) = d#/du = 0 with p. = r-/a. As may be seen from Fig. 
72, re/a ~ 1 for small z;/a. Equation (36b), 


6 = V/r. = 7(cos 8 — B), (42) 
where 7 = a72/podod. | He|, Bi = 8SH-ape/tdsJo|He|, and He = H (Ue) 
is the value of Hat the equilibrium radius, has the solution 


tan(6,,/2) + ea 


tan(6,/2) — tan(@/2) oe 


= resem in| 
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for a bubble at @ = 0 when ¢ = 0. The angle 6,, is attained as t > © and 
satisfies the condition cos 6,, = £,. For the numerical values used in 
the previous example together with p. = 1.00 and | #.| = 4.56, one 
finds 6,, = 56.0° and the time needed for travel from 6 = 0 to 8 = 0.98, 
is 1.48 ps. 


10.5 Discussion 


The graphs of Figs. 65 to 69 apply to an infinite number of conducting 
stripes. As shown in these figures, #, = 0 at the origin, a conductor 
center, and rises to a maximum at the right conductor edge. The 
symmetry of #, about a conductor edge for s = 4a together with the 
periodicity and antisymmetry expressed by eq. (16) completes the 
description of #, for one period. See Fig. 8 for a typical spatial 
variation of #, over several periods. 

When the number of conducting stripes is finite, #, still has the 
wavelike behavior per period as described above. There are, however, 
essential differences. The function #, is not periodic. In addition, the 
wavelike behavior of #, is superimposed on a gradual change in the 
mean value of the bias field. This change in bias is due to the current 
I = 2an,J, flowing in the n, stripes. The magnitude of the effect can 
be estimated, using eq. (15), for a single stripe of width (nm; — 1)s + 2a. 
To illustrate the quality of the estimate, consider n, = 11 and s = 4a. 
Choose the origin at the middle of the center stripe and calculate #, 
at the center of each stripe and at the corresponding abscissa for a 
single stripe of width (n,; — 1)s + 2a = 42a. Then, for a = 1, the 
numerical results are shown in Table VI. Here, #; is the normalized 
intensity for the single stripe. In the design of current-access devices, 
the shift in local bias must be taken into account. By returning the 
drive current through a nearby stripe of comparable width, this shift 
in local bias can be reduced. 

To develop current-access propagation structures, large-scale models 
were made and the magnetic fields due to current flow in them probed 
with a coil. See Section III. One structure was a stripe pattern which 
served to calibrate the coil in the following way. The coil of radius 7, 
and height A was placed over a central stripe with the coil center 


Table VI 
Stripe 
Index Abscissa _ - 
n x/h | #5 | | 41 | 
0 0 0 0 
mast +4.0 0.367 0.385 
+2 +8.0 0.760 0.802 


+3 +12.0 1.218 1.293 
+4 +16.0 1.829 1.995 
+5 +20.0 2.928 3.582 
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directly over the stripe edge. This location permitted maximum flux 
linkage. One can now calculate the flux threading the coil from 


ro (ra 
>= | BdA= a, | Hu) du | dv, (44) 
0 0 


where the u-v coordinate system has its origin at the coil center with 
u parallel to x and v parallel to y. The integration proceeds over the 
first quadrant of the coil cross section; hence, the multiplier of 4. 
Finally, eq. (44) will give the flux in maxwells when the current density 
is expressed in A/mm and the coil radius in cm. Numerical integration 
over u utilized the parameter values r, = 0.1 in., A = 0.2 in., and z; = 
0.025 in. Using the stripe width 2a = 0.5 in., one gains entry to Fig. 66 
with h/a = 0.8 and z,/a = 0.1 to determine the values of #,(u). 
Numerical quadrature yields ¢ = (6.183J./7)7r2. The result is written 
in this form because division by zr? provides the flux density averaged 
throughout the coil volume. For a sinusoidal drive J, = Jmsin wt, the 
rms voltage induced in the coil is 


Ens = 6.183N,wr2Jm X 1078/V2 = 2.1 mV (45) 


for N; = 100 turns, w/27 = 30 kHz, and Jj, = 1/25.4 A/mm. This result 
compares favorably with the measured value of 2.0 mV. With the coil 
calibration in hand, one can assign absolute field values to voltage 
measurements from the large-scale models of proposed propagation 
structures. Apart from a slight increase in labor associated with a two- 
dimensional numerical integration over the coil cross section, one can 
also use eq. (25) for coil calibration. This has been done, and the results 
show a comparably small difference between the measured and cal- 
culated coil voltage. 


XI. MISCELLANEOUS 


This section contains items that do not seem to fit elsewhere. These 
include a three-level conductor circuit, a single level circuit with a 
lattice of etched islands, a deep implant layer for hard bubble sup- 
pression, and an analysis of bubble-bubble interactions. 


11.1 Three-level conductor circuits 


Advances in processing techniques will ultimately allow fabrication, 
at high yield, of structures with a complexity unattainable today. One 
structure that could become practical is the three-level circuit we 
discuss at this time. 

Bias shift due to current flow in a conductor and its consequences 
was covered in Section III and, in particular, in Fig. 9. By using three- 
conductor levels and returning the current of the excited level through 
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the remaining two, one can ease the bias-shift problem and utilize for 
storage the full conductor width. Another advantage is simpler elec- 
tronics; only unipolar drivers are needed. 

Two possible propagation structures are shown in Fig. 78. Circuit 
(a) has slots side by side in each of the three levels. The number of + 
and — signs indicates the relative pole strengths at the ends of the 
slots when level 1 is energized with two units of current, half of which 
returns through each of the remaining levels. The wavelike curve 
sketched below is an approximation to H, near the lower ends of the 
slots. If the levels are excited in turn with pulse currents J; in the 
sequence [---123---], the wave will move to the right and bubble 
propagation will be to the right. Interchange any two drives in the 
sequence, such as [---213---], and bubbles move left. Circuit (b) with 
in-line oval holes in the three levels generates the indicated rela- 
tive pole strengths when level 1 is excited. Here also, the sequence 


LEVEL 
77 NUMBER 
7 
1]/])2] 1/3 1 2) 13741 — |; 
+ _ 
+ + + 
H, 
(a) 
—— LEVEL NUMBER 
ee 
1 2 3 1 2 3 1 
+ bt Fe Se SE Se FOF SK Se = 


(b) 


Fig. 783—Two types of three-level circuits are shown: in-line (a) and transverse (b) 
propagation. In either case, the drive current is returned through the remaining levels 
to provide z-field cancellation. 
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[---123---] will cause bubble motion to the right, and the reversal of 
any two drives in the sequence, to the left. 


11.2 Single-level lattice-like circuits 


In most single-level conductor circuits, a rather precise fit of the 
bubble with the feature which produces the static offset field is 
required. In other words, these circuits cannot tolerate a change in 
bubble size. A lone exception is that of Dekker in which permalloy 
points driven by an in-plane field generate the offset field. Other 
alternatives are possible. From the magnetostatic equivalence of bub- 
bles and cylindrical holes etched in a garnet layer (craters), one can 
deduce that, at a distance, craters repel bubbles. Therefore, an array 
of craters can provide interstitial sites that precisely position bubbles. 
Bubble motion would then be somewhat analogous to electrons jump- 
ing from trap to trap in a semiconductor. Devices based on this 
principle are under study. 


11.3 Deep implantation for hard bubble suppression 


Data on the performance of implanted and nonimplanted films in 
dual-conductor devices were presented in Figs. 32 to 35. In particular, 
Figs. 32 and 34 point out that the minimum drive currents are higher 
when implanted films are used. This we attribute to a reduction in the 
effective drive fields because of the shielding effect of implanted layers. 
If the garnet is implanted at the garnet-substrate interface, this shield- 
ing will be eliminated. Experiments with a 2.25-um thick YSmLuCaGe 
garnet implanted at 2 x 10° with H* at 300 Kev confirmed this 
expectation. We anticipate that deep implantation will be especially 
useful if the garnet layers are less than a micron thick since the buried 
layer will precisely determine bubble height and, in addition, decouple 
bubbles from the sometimes troublesome substrate-garnet interface. 

It is known that an in-plane field can be used to convert bubbles 
into the S = 0 state; however, the field must be very large if used with 
small bubble garnets. An analysis by Okabe* suggests that the mag- 
nitude of this field will be reduced with a very thick implant layer. A 
thick implant layer is practical only if it is buried. 


11.4 Bubble-bubble interactions 


We have already stressed the point that some conductor-access 
circuits can accommodate a wide range of bubble sizes. There is no 
“gap,” in the sense of the permalloy circuits, to be overcome. In an 8- 
pum period, dual-conductor circuit, for example, bubbles ranging from 
4 ym to less than 0.8 um have been propagated. This tolerance to 
bubble size is especially useful when bubble-bubble interactions are a 
concern. 
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Bubble-bubble interaction imposes the following constraints on the 
circumstances under which a pair of adjacent bubbles in a track are 
stable in position when not driven and move in step when driven. In 
the absence of any drive excitation, the condition 


H. = 67°M,hr2/a4, (46) 


where r, is the bubble radius assures positional stability. That is, the 
mutual-interaction field gradients are just balanced by the coercivity 
H.. During propagation, the trailing bubble must be driven to overcome 
both its coercivity and an interaction gradient from the leading bubble. 
The addition of this interaction is equivalent to a doubling of H.. The 
drive current must therefore generate a gradient dH,/dx given by 


2r.0H,/dx = 16H./7. (47) 


It follows that, since J ~ dH./dx, P/bit ~ (H.ro)”. From (46), it is seen 
that r, ~ Hi”? so 


P/bit ~ H4”. (48) 


Equation (48) tells us that low-coercivity garnets are preferable to 
high-coercivity garnets since, with their use, the power dissipation will 
be reduced in spite of the interaction constraint (46). 


XI]. SUMMARY AND CONCLUSIONS 


The purpose of this paper was to introduce new, very high speed, 
conductor-access devices where apertured conducting sheets replace 
the serpentine conductors of conventional conductor devices and the 
rotating-field coils of field-access devices. When apertured sheets are 
used, the bias field perturbations are highly localized, and therefore 
the drives can be ranged over wide limits. We have also shown that 
the impedance of the conducting layers can be made to match that of 
drivers operated from low-voltage (~5V) power supplies. The design 
of single; dual, and triple level conductor circuits was discussed. 

All the functions for a shift register or a major-minor chip were 
covered, although some of the designs have not been optimized. For 
example, we have not as yet detected at a megahertz stepping rate. 
However, RF detection does look promising, and the redesign of both 
the detection electronics and the detector is in progress, and the results 
will be reported later. 

The material requirements for the current-access device technology 
have been discussed. The high-speed performance inherent in this 
technology places more emphasis on the material parameters of bubble 
mobility and dynamic coercivity. Using the (YLuSmCa)3(FeGe);012 
system as a starting point, a technique has been presented to exercise 


1536 THE BELL SYSTEM TECHNICAL JOURNAL, JULY-AUGUST 1979 


the bubble mobility while maintaining adequate uniaxial anisotropy. 
Compositions in the three systems, (YLuSmCa)3(FeGe);O12, (LaLu- 
SuCa)3(FeGe)sOi2, and (LaLuSm)3(FeGa)s;O0.2, have been compared 
and the limitations found in the systems discussed. A simplified film 
composition in the (LaLuSm)3(FeGa)sO.2 systems is observed that 
exhibits 1.7-um diameter bubbles along with high mobility and low 
dynamic coercivity. 

Plasma etching has been used to define circuit features in the AlCu 
films. The anisotropic etch characteristics of the plasma etch process 
have produced metal patterns dimensionally identical to the photolith- 
ographically defined etch masks. Coupled to the high material-etch 
rate selectivity, the plasma etch process appears to be ideal for pro- 
ducing the very small period devices that current-access technology 
promises. Circuits have also been successfully fabricated using Au 
conductors patterned by ion milling technology; for Al conductors, 
however, the plasma etching has been the more practical etch method. 
Thin, conformal, insulating layers of SiOz have been successfully 
deposited by plasma deposition at low substrate temperatures, and it 
appears that this process is particularly suited for fabricating conduc- 
tor-access devices. 

Current-access and field-access devices differ, of course, in many 
ways. In terms of analytical complexity, the nonlinear permalloy- 
bubble interactions of the field-access scheme have been removed in 
current-access devices because permalloy is not used. One may antic- 
ipate, therefore, simpler, more frequent, and more accurate analyses 
in this new technology. 

It is too early to conclude that conductor-access devices will become 
the technology of the next generation bubble devices. In this context, 
we include all the conductor-access devices now under development 
here and elsewhere. It is possible, for example, that the apertured 
sheet drive approach applied to the bubble-lattice file’ will make that 
structure more attractive. There are some features that only conduc- 
tor-access devices possess. If high speed 1s a necessity, as in video- 
based systems, conductor circuits are the only option available to the 
device designer at present. The anticipated small package size will be 
significant in applications for which space is a premium. The ability to 
run with logic-level power supplies is another plus for conductor 
devices. Finally, conductor circuits will require fewer precision analog 
control circuits since the critical control function to rotating field 
timing has been eliminated. 
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Errata 


F. R. K. Chung and F. K. Hwang, “On Blocking Probabilities for a Class of Linear 
Graphs,” B.S.T.J., 57, No. 8 (October 1978), pp. 2915-2925. 


We acknowledge the help of M. Horgan of England who pointed out 
an error in our paper. On page 2918, we define 
f(Bue-1)/2)) =(1- O12) Bur 721 if ¢ is even. 


The correct definition should be 


RB 7 F : 
f(Bue-v2y) = (= ay2-1erp2er241)'-) if tis even. 


F.R.K.C. 


A. N. Netravali, F. W. Mounts, and K. A. Walsh, “Adaptation of Ordering Techniques 
for Facsimile Pictures with No Single Element Runs,” B.S.T.J., 58, No. 4 (April 1979), 
pp. 857-865. 


Fig. 1 (continued), page 859, is actually Fig. 2 (continued), page 861, 
and Fig. 2 (continued) is Fig. 1 (continued). 
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A 30-GHz Scale-Model, Pyramidal, 
Horn-Reflector Antenna 


By R. A. SEMPLAK 
(Manuscript received January 24, 1979) 


1. INTRODUCTION 


In the early 1940s, the pyramidal horn-reflector antenna was in- 
vented’ at Bell Laboratories, Holmdel, New Jersey. It is now in 
extensive use in the Bell System 4-, 6-, and 11-GHz, transcontinental, 
microwave, common-carrier, radio-relay network.” This antenna is a 
combination of a square electromagnetic horn and a reflector that is a 
section of the paraboloid of revolution. The apex of the square horn 
coincides with the focus of the paraboloid. The antenna is essentially 
a shielded, offset, parabolic antenna, so that very little of the energy 
incident on the reflector is reflected back into the feed to produce an 
impedance mismatch. 

Radio interference from adjacent paths limits the number of con- 
verging routes of a common carrier microwave radio system, and in 
recent years, demands have been made to improve the sidelobe per- 
formance of the pyramidal horn-reflector antenna. In response, blind- 
ers (metallic extensions to the sidewalls of the horn) have been 
developed® which provide a degree of far sidelobe reduction, i.e., 
sidelobes beyond 35 degrees from the axis of the main beam. 

To continue the investigation for ways of reducing the sidelobe 
levels of this type antenna, a scale model was built. The model has a 
numerically-machined, precision reflector, and the scaling factor is 7.5, 
which means that measurements made at a frequency of 30 GHz will 
represent the performance of a full-size antenna measured at a fre- 
quency of 4 GHz. This paper presents and discusses the measurements 
made on the scaled model at 30 GHz. Comparisons are made with data 
obtained (at a frequency of 4 GHz) by others on full-sized antennas.‘ 
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In the discussion that follows, it should be remembered (from Ref. 
4) that longitudinal polarization and longitudinal plane indicate that 
the electric field in the aperture and the plane of antenna rotation, for 
radiation measurements, are aligned with the pyramidal horn axis, 
whereas transverse polarization and transverse plane indicate that the 
electric field in the aperture and plane of antenna rotation are perpen- 
dicular to the horn axis.* 


li. DISCUSSION 


A study of the historical data on the radiation characteristics of the 
horn-reflector antenna indicates that, since its inception in the 1940s, 
disagreements between measured data and theoretical values have 
existed. For example, in the transverse plane with longitudinal polari- 
zation, agreement between measured data and theoretical value exists 
only out to the first sidelobe. With transverse polarization, good 
agreement extends out to the fourth sidelobe. 

Since the aperture of the horn-reflector antenna is illuminated with 
a dominant waveguide mode, the theoretically obtainable off-axis 
radiation levels‘ in both the transverse planes for transverse polariza- 
tion are considerably higher than those obtained for longitudinal 
polarization in the transverse plane and transverse polarization in the 
longitudinal plane. For transverse polarization in the transverse plane, 
one has essentially the equivalent of an aperture with constant illu- 
mination across it. Hence, with the antenna mounted on its normal 
vertical position (i.e., the axis of the horn normal to the earth), the 
amplitude distributions across the aperture of the antenna are as 
follows: For longitudinal polarization, the electric field in the longitu- 
dinal direction is uniformly distributed across the aperture, and in the 
transverse direction the field is tapered to a low value at the edges; 
with transverse polarization, the electric field is uniform across the 
aperture in the transverse direction and is tapered to a low value at 
the edges in the longitudinal direction. 

Of the many various possible contributors to high sidelobes, the two 
strongest contenders are: (1) higher-order modes being generated in 
the existing feedhorn, which was designed to accommodate the over- 
sized circular waveguide} used in the system, and (iz) surface toler- 


* As used in the microwave radio relay system, the horn-reflector antenna is mounted 
with the axis of the horn normal to the earth’s surface. Hence, longitudinal and 
transverse polarizations could be called vertical and horizontal, respectively. However, 
the aperture field distribution for each polarization is different, and when the antenna 
is used as an earth station antenna for satellite communications, or as a radiometer, or 
simply to obtain radiation patterns in the longitudinal plane (the antenna is mounted on 
its side), aperture field distributions for so-called vertical and horizontal are now 
interchanged. To avoid this ambiguity, longitudinal and transverse polarizations which 
are referred to the axis of the horn are used. 

+ Higher-order modes can be excited in oversized waveguide; hence to avoid this 
condition when obtaining radiation patterns, one uses a transition from feedhorn to 
waveguide transducer. 
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ances of the parabolic reflector. To examine the first contender, two 
feedhorns were made, one a scaled version of the existing system 
feedhorn and the second a new design. This new feedhorn terminates 
the taper of the horn into the walls of rectangular K-band waveguide, 
thus ensuring the presence of only the dominant mode. In a sense, this 
is achieved by opening the parallel walls of the waveguide at a constant 
angle and extending them until they form the square aperture needed 
to mate with the rest of the antenna. This, of course, requires that the 
walls normal to the electric field open first. Carefully machined man- 
drels were made of both feeds over which copper was electroformed to 
produce the final models. An examination of the radiation character- 
istics of the scale-model antenna obtained from using both feeds 
indicated that the characteristics were essentially identical. Hence, the 
characteristics contained in the figures to be discussed were obtained 
using the scale version of the system feedhorn. In the figures, the 
theoretical values* are shown by a dashed line, and the measured data 
of the full-sized horn-reflector antenna by a broken line. The measured 
data for the scale-model are shown by the solid line. 

For the transverse plane, Figs. 1 and 2 show the measured charac- 
teristics for longitudinal and transverse polarizations, respectively. As 
indicated here, the agreement between scale-model measurements and 
theory is remarkably good. From these two figures, one can see the 
improvement in sidelobe level by comparing the results with the 
broken line curve which indicates the measured response for the full- 
sized antenna. For example, in Fig. 1 the improvement is about 8 dB 
and increasing in the region of the fourth sidelobe and beyond, i.e., 
beyond 8 degrees from the on-axis position. In Fig. 2, a similar 
comparison shows a 4-dB improvement beginning around 10 degrees. 

In the longitudinal plane, the measurements for longitudinal and 
transverse polarizations were obtained for the scale-mode antenna but 
are not included here since these patterns, like those of the full-size 
antenna, are in very good agreement with theory. Similarly, the cross- 
polarization patterns obtained on the scale model were like those of 
the full-sized antenna and, since both agree well with theory, are also 
not included here. 

It is evident from these figures that the scale-model measurements 
are in remarkable agreement with theory. The presumption that the 
high sidelobe levels of the full-size antenna were due in part to high- 
order modes being generated by the feedhorn is not valid. This is 
emphatically demonstrated by the agreement with theory which does 
not accommodate the presence of higher-order modes. The remaining 
strong contender for these high sidelobe levels is the accuracy of the 
full-size parabolic reflector surface. An examination of the full-size 
structure shows the surface to consist of two half skins fastened to 
parabolic back ribs by means of 262 (No. 14, Type B) sheet metal 
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Fig. 1—Radiation pattern in the transverse 
measurements obtained on the full-size antenna 
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Fig. 2—Radiation pattern in the transverse plane for transverse polarization. 
measurements obtained on the full-size antenna. 
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screws and 120 (3/16-inch diameter Universal Head) rivets. Since 
these fasteners are distributed in a regular fashion, one could envision 
them as forming an array of scatterers which produce a low level, very 
broad beam and enhance the far sidelobe region. An investigation of 
this remaining aspect is not considered in the immediate future since 
it would entail a reproduction of these discontinuities on the precise 
surface of the scale-mode. Also, there are many interesting experiments 
waiting to be done on this scale model which can now serve as a test 
bed. 


Ill. CONCLUSIONS 


Data obtained on a scale-model version of the full-sized horn-reflec- 
tor (used in the 4-, 6-, and 11-GHz microwave common-carrier, radio 
relay system) have been presented. From the data, one once again 
observes how well scale models depict the full-size world. But of most 
importance was the achievement of agreement between measurement 
and theory which produced the key finding that past disagreement 
between theoretical results and experimental data for a full-size an- 
tenna in the sidelobe region could not be due primarily to higher order 
modes being excited in the feedhorn, but could be attributed to surface 
imperfections of the reflector. 
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